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Preface 



These are the notes of the second-year course Algebraic Structures I at Queen 
Mary, University of London, as I taught it in the second semester 2005-2006. 

After a short introductory chapter consisting mainly of reminders about such 
topics as functions, equivalence relations, matrices, polynomials and permuta- 
tions, the notes fall into two chapters, dealing with rings and groups respec- 
tively. I have chosen this order because everybody is familiar with the ring of 
integers and can appreciate what we are trying to do when we generalise its prop- 
erties; there is no well-known group to play the same role. Fairly large parts of 
the two chapters (subrings/subgroups, homomorphisms, ideals/normal subgroups, 
Isomorphism Theorems) run parallel to each other, so the results on groups serve 
as revision for the results on rings. Towards the end, the two topics diverge. In 
ring theory, we study factorisation in integral domains, and apply it to the con- 
struction of fields; in group theory we prove Cayley's Theorem and look at some 
small groups. 

The set text for the course is my own book Introduction to Algebra, Ox- 
ford University Press. I have refrained from reading the book while teaching the 
course, preferring to have another go at writing out this material. 

According to the learning outcomes for the course, a studing passing the 
course is expected to be able to do the following: 

• Give the following. Definitions of binary operations, associative, commuta- 
tive, identity element, inverses, cancellation. Proofs of uniqueness of iden- 
tity element, and of inverse. 

• Explain the following. Group, order of a group, multiplication table for a 
group, subgroups and subgroup tests, cyclic subgroups, order of an element, 
Klein four group. 

• Describe these examples: groups of units of rings, groups of symmetries of 
equilateral triangle and square. 

• Define right cosets of a group and state Lagrange's Theorem. Explain nor- 
mal subgroup, group homomorphism, kernel and image. 
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• Explain the following: ring, types of rings, subrings and subring tests, ide- 
als, unit, zero-divisor, divisibility in integral domains, ring homomorphism, 
kernel and image. 



Note: The pictures and information about mathematicians in these notes are 

taken from the St Andrews History of Mathematics website: 

http : //www-groups . dcs . st-and . ac . uk/ -hi story/ index . html 



Peter J. Cameron 
April 13, 2006 
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Chapter 1 
Introduction 



The first chapter of the notes will tell you a bit about what this subject involves, 
and then will go over material that you should be familliar with: sets, relations, 
functions; equivalence relations; matrices and polynomials; and permutations. 
A couple of reminders about notation: 

• N, Z, Q, R and C denote the natural numbers, integers, rational numbers, 
real numbers, and complex numbers respectively; 

• if A is an m x n matrix, then Aij denotes the entry in the z'th row and 7th 
column of A, for 1 <i<m and 1 < j < n. 

1.1 Abstract algebra 

Algebra is about operations on sets. You have met many operations; for example: 

• addition and multiplication of numbers; 

• modular arithmetic; 

• addition and multiplication of polynomials; 

• addition and multiplication of matrices; 

• union and intersection of sets; 

• composition of permutations. 

Many of these operations satisfy similar familiar laws. In all these cases, the 
"associative law" holds, while most (but not all!) also satisfy the "commutative 
law". 
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The name "algebra" comes from the title of 
the book Hisab al-jabr w'al-muqabala by 
Abu Ja'far Muhammad ibn Musa 
Al-Khwarizmi, a Persian mathematician who 
lived in Baghdad early in the Islamic era 
(and whose name has given us the word 
'algorithm' for a procedure to carry out some 
operation). Al-Khwarizmi was interested in 
solving various algebraic equations 
(especially quadratics), and his method 
involves applying a transformation to the 
equation to put it into a standard form for 
which the solution method is known. 
We will be concerned, not so much with solving particular equations, but gen- 
eral questions about the kinds of systems in which Al-Khwarizmi 's methods might 

apply- 
Some questions we might ask include: 

(a) We form C by adjoining to K an element i satisfying i 2 = —1, and then 
assert that the "usual laws" apply in C. How can we be sure that this is 
possible? What happens if we try to add more such elements? 

(b) What is modular arithmetic? What exactly are the objects, and how are the 
operations on them defined? Does it satisfy the "usual laws"? 

(c) What are polynomials? Do they satisfy the "usual laws"? What about ma- 
trices? 

(d) Do union and intersection of sets behave like addition and multiplication of 
numbers? What about composition of permutations? 

(e) What are the "usual laws"? What consequences do they have? 

In this course we will define and study two kinds of algebraic object: 

rings, with operations of addition and multiplication; 

groups, with just one operation (like multiplication or composition). 

Groups are in some ways simpler, having just a single operation, but rings are 
more familiar since the integers make a good prototype to think about. 
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1.2 Sets, functions, relations 

Sets Two sets are equal if and only if they have the same members. That is, 

A = B if and only if ((* G A) (xeB)). 
This means that, to prove that two sets are equal, you have to do two things: 

(i) show that any element of A lies in B; 

(ii) show that any element of B lies in A. 

Of course, (i) means that A C B (that is, A is a subset of B), while (ii) means 
B CA. So we can re-write our rule: 

ACB if and only if ((* e A) => (x E B)), 
A=B if and only if ACBandBCA. 

From two sets A and B we can build new ones: 
union: AU B = {x : x E A or x E B}; 
intersection: AHB = {x: x EA and x E B; 
difference: A\B = {x : x E A and x B}; 
symmetric difference: AAB = (A\B)U(B\A). 

Cartesian product 

If A and B are sets, their cartesian product 
A x B is the set of all ordered pairs (a,b) for 
a E A and b E B. The name commemorates 
Descartes, who showed us that we can match 
up the points of the Euclidean plane with 
K x K by using cartesian coordinates: the 
point x units east and y units north of the 
origin is matched with the pair (x,y). Then 
an equation connecting x and y describes the 
set of points on some curve in the plane: 
geometry meets algebra! 
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Functions A function / from A to B is, informally, a "black box" such that, if 
we input an element a E A, then an element f(a) E B is output. More formally, a 
function is a set of ordered pairs (that is, a subset of the cartesian product Ax B) 
such that, for any a E A, there is a unique b E B such that (a,&) G /; we write 
b = f(a) instead of (a, b) E f. 

The sets A and B are called the domain and codomain of /; its z'mage consists 
of the set 

{b E B :b = f(a) for some a E A}, 

a subset of the codomain. 
A function / is 

surjective (or onto) if, for every b E B, there is some a EA such that b = f(a) 
(that is, the image is the whole codomain); 

injective (or one-to-one) if a\ ^ ci2 implies f{a\) ^ f{a2) (two different elements 
of A cannot have the same image); 

bijective if it is both injective and surjective. 



Operations An operation is a special kind of function. 

An n-ary operation on a set A is a function / from A n = A x ■ - x A to A. 

« times 

That is, given any a\,...,a n E A, there is a unique element I? = f{a\ , ...,««) G A 
obtained by applying the operation to these elements. 

The most important cases are n = 1 and n = 2; we usually say jmary for "1- 
ary", and binary for "2-ary". We have already seen that many binary operations 
(addition, multiplication, composition) occur in algebra. 



Example Addition, multiplication, and subtraction are binary operations on 
M, defined by 

/(a, b) = a + b (addition), 

f{a,b) = ab (multiplication), 

f{a, b) = a — b (subtraction). 

Taking the negative is a unary operation: f(a) = —a. 
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Notation As the above example suggests, we often write binary operations, 
not in functional notation, but in either of two different ways: 

• infix notation, where we put a symbol for the binary operation between the 
two elements that are its input, for example a + b, a — b, a-b, a*b, aob, 
a»b;or 

• juxtaposition, where we simply put the two inputs next to each other, as ab 
(this is most usually done for multiplication). 

There are various properties that a binary relation may or may not have. Here 
are two. We say that the binary operation o on A is 

• commutative if aob = boa for all a, b G A; 

• associative if (a o b) o c = a o (b o c) for all a,b,c G A. 

For example, addition on R is commutative and associative; multiplication of 2 x 2 
matrices is associative but not commutative; and subtraction is neither. 

A binary operation * on a finite set A can be represented by an operation table, 
with rows and columns labelled by elements of A. In row a and column b we put 
a * b. Here is a small example. 



* 


a 


b 


a 


a 


b 


b 


a 


a 



Relations A binary relation R on A is a subset of A x A. If (a,b) G R, we say 
that a and b are related, otherwise they are not related, by R. 

As with operations, we often use infix notation, for example a < b, a < b, 
a = b,a = b,a~b. But note the difference: 

+ is an operation, so a + b is a member of A; 

< is a relation, so a < b is an assertion which is either true or false. 

Example Let A = { 1 , 2, 3}. Then the relation < on A consists of the pairs 

{(1,2),(1,3),(2,3)}, 

while the relation < consists of the pairs 

{(1,1), (1,2), (1,3), (2,2), (2,3), (3,3)}. 

Also like operations, there are various laws or properties that a relation may 
have. We say that the binary operation R on A is 
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• reflexive if (a, a) E R for all a E A; 

• irreflexive if (a, a) £ R for all a G A; 

• symmetric if (a,fe) G 7? implies (fe,a) G i?; 

• antisymmetric if (a, &) and (fe, a) are never both in R except possibly if a = b; 

• transitive if (a, b) E R and {b,c ER imply (a, c) E R. 

For example, < is irreflexive, antisymmetric and transitive, while < is reflexive, 
antisymmetric and transitive. 

1.3 Equivalence relations and partitions 

A binary relation R on A is an equivalence relation if it is reflexive, symmetric 
and transitive. 

A partition P of A is a collection of subsets of A having the properties 

(a) every set in P is non-empty; 

(b) for every element a E A, there is a unique set X E P such that a EX. 
The second condition says that the sets in P cover A without overlapping. 



The first important fact we meet in the course is this: 

Equivalence relations and partitions are essentially the same thing. 
Any equivalence relation on a set gives us a partition of the set, and 
any partition comes from a unique equivalence relation. 

We will state this as a theorem after the next definition. 

Let R be an equivalence relation on a set A. For any a E A, we define the 
equivalence class of a to be the set of all elements related to a: that is, 

R(a) = {bEA: (a,b) ER}. 

If we don't need to mention the name of the equivalence relation, we may denote 
the equivalence class of a by [a] . 
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Theorem 1.1 If R is an equivalence relation on a set A, then the equivalence 
classes of A form a partition of A. 

Conversely, ifP is a partition of A, then there is a unique equivalence relation 
R on A for which P is the set of equivalence classes ofR. 

Proof First, let R be an equivalence relation on A, and let P be the set of equiva- 
lence classes of R: that is, P = {R(a) :aeA}. We have to show two things. 

(a) First we show that the members of P are all non-empty. Take an equivalence 
class, say R(d). By the reflexive law, (a, a) £ R; then, by definition, a £ 
R(a). So R(a) is not empty. 

(b) Now take any element a £ A; we must show that a lies in exactly one equiv- 
alence class. From what we just did, we know that a lies in the equivalence 
class R(a) ; so we have to show that, if a lies in another class, say, R(b), then 
R(b) — R(a). Since a £ R(b), we know that (b,a) £ R. According to the 
rule for proving two sets equal, we have two things to do: 

(i) Take x £ R(a). By definition, (a,x) £ R. We know that (b,a) £ R. 
Applying the transitive law, we see that (b,x) £ R, that is, x £ R(b). So 
R(a)CR(b). 

(ii) Take x £ R(b). By definition, (b,x) £ R. We know that (b,a) £ R, 
and so by the symmetric law, (a,b) £ R. Then by the transitive law, 
(a,x) ER,soxe R(a). Thus, R(b) C R(a). 

These two arguments allow us to conclude that R(a) = R(b), and we are 
done. 











■a 
■b 



If (a, b) £ R, then R(a) = R(b) . 

Now we prove the converse. Suppose that P is a partition of A. We define a 
binary relation R on A by the rule that 

(a, b) £ R if and only if a,b EX for some set X £ P. 

We have to show that R is an equivalence relation, that is, to check the three laws. 

reflexive: Take any a £ A. Since P is a partition, a lies in some set X £ P. Then 
a, a £ X, so (a, a) £ 7? by definition. 
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symmetric: Suppose that (a,b) E R. Then a : b EX for some X E P. Since the 
order doesn't matter, it follows that (b, a) E R. 

transitive: Suppose that (a,b) G R and (b,c) G R. Then there are sets X,Y G P 
such that a,b EX,b,c EY. Now P is a partition, so Z? lies in a unique set of 
P. This means that X = Y . Now a,c EX, so (a, c) G P. 

Finally we have to show that, for this relation R, the equivalence classes are 
the sets in P. Let a G A, and let X be the unique set of P which contains a. We 
have to show that X = R(a) . As usual, there are two jobs: 

lib EX, then a,b EX, so (a,b) G P, so b G 7?(a). 

if £ G /? (a), then (a, G R, so there is some Y E P with a, G Y. But there 
is a unique set X E P containing a, so Y = X, whence b EX. 

Thus X =R(a) as required. 

Example 1 Let / : A — > 5 be a function. Define a relation 7? on A by the rule that 
(fli,a2) G P if and only if f(a\) = /(«2)- Then / is an equivalence relation. (It is 
completely straightforward to show that R is reflexive, symmetric and transitive: 
try it!) There is a bijection between the equivalence classes of R and the points in 
the image of the function /. 

For example, let A = B = { 1, 2, 3, 4, 5} and let f(x) =x 2 — 6x+ 10. Calculation 
gives 



X 


1 2 3 4 5 


m 


5 2 12 5 



So the equivalence classes of the relation R are {1,5}, {2,4}, and {3}. 

Example 2 Let n be a positive integer. Recall that two integers x and y are con- 
gruent modulo n, written x = y (mod n) , if n divides y — x. This is an equivalence 
relation: 

reflexive: n divides = x — x, sox = x (mod n). 

symmetric: Suppose that x = y (mod n), so that n divides y—x. Then n divides 
— {y — x) =x — y,soy = x (modn). 



transitive: Suppose that x = y (mod n) and y = z (mod n) . Then n divides y — x 
and z — y, so divides (y — x) + (z — y) = (z—x); hence x = z (mod n) . 
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The equivalence classes of this relation are the congruence classes modulo n. 
Sometimes we write the equivalence class of x modulo n as [x] n . Thus, 

M« = {• • • ,x — 2n,x — n,x,x + n, x + 2n, x + 3n, . . .}, 

an infinite set. The total number of equivalence classes is n; the classes are 
[0]„, [1]„, . . . , [n - \\ n (and then they repeat: [n\ n = [0]„). 

A representative of an equivalence class is just an element of the class. The 
system is completely egalitarian: anyone can be the class representative! As we 
have seen, if b G R(a), then R(a) = R(b). A semphset of representatives for R is a 
set of elements, one from each equivalence class. 

Sometimes there is a particularly nice way to choose the representatives, in 
which case they are called canonical. (This is not a mathematical term, since we 
have to decide what we mean by "nice"!) For example, the integers {0, 1, . . . ,n — 
1} form a canonical set of representatives for congruence modulo n. 

1.4 Matrices 

You should be familiar with matrices, and with the rules for adding and multiply- 
ing them. Usually, the entries of matrices are numbers, but we will take a more 
general view. Thus, if S is any set, then M mxn (S) means the set of all m x n matri- 
ces whose entries belong to 5; a matrix is just an rectangular array with elements 
of S in the positions. We denote the element in row i and column j of the matrix A 
by Aij. in the case of square matrices, with m = n, we simply write M n (S). 
The rules for matrix operations are: 

addition: if A and B are two m x n matrices, then A + B = C means that Qj = 
A^ + Bij for 1 < i < m and 1 < j < n. Note that this requires us to have 
a binary operation called "addition" on the set S. Note too that the two 
occurrences of + here have different meanings: A + B defines the operation 
of addition of matrices, while A i; is the given operation of addition on 
S. 

multiplication: suppose that A G M mxn (S) and B G M nxp (S), that is, the number 
of columns of A is the same as the number of rows of B. Then AB = D 
means that 

n 

for 1 < i < m, 1 < j < p. This is more complicated than addition. In order to 
be able to multiply two matrices, we need both addition and multiplication 
to be defined on the elements of S. Also, we must be able to add together n 
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elements of S, even though + is a binary operation. We will look further at 
how to do this later. 

Remember that there are conditions on the sizes of the matrices for these rules 
to work: we can only add two matrices if they have the same size; and we can 
only multiply two matrices if the number of columns of the first is equal to the 
number of rows of the second. In particular, for n x n matrices, both addition and 
multiplication are defined. 

Properties of addition and multiplication for matrices depend on properties of 
addition and multiplication for S. 

For example, let us prove the associative law for matrix multiplication. Sup- 
pose, for convenience, that A,5,C G Af„(R), so that we can use all the familiar 
properties of addition and multiplication of real numbers. Then 

j^{AB)ikC kj 

k=\ 

n n 

Y,AuBi k C k j, 

k=\l=\ 

n 

^A lk {BC) k] 

k=\ 

n n 

Y AikBkiC u- 

These two expressions differ only by swapping the names of the "dummy vari- 
ables" k and /, so are equal. 

If you are not entirely comfortable with dummy variables, write out (in the 
case n = 2 the four terms in each of the two sums (with i and j fixed, but k and / 
each taking the values 1 and 2) and check that the results are the same. 

1.5 Polynomials 

Again, for the moment, the coefficients of a polynomial will be numbers; but we 
will generalise this later. 

If you think that you know what a polynomial is, answer the following ques- 
tions: 

Is \x 2 + 0* + 2 the same polynomial as x 2 + 2? 

Is Ox 3 +x 2 + 2 the same polynomial as x 2 + 2? What is its degree? 

Is y 2 + 2 the same polynomial as x 2 + 2? 



{{AB)C)ij = 



(MBC))ij = 
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1 hope you answered "yes" to all these questions, and said that the degree is 2 in 
the second case. But these examples show that defining polynomials is going to 
be a bit complicated! So we defer this, and pretend for the moment that we know 
what a polynomial is. A first attempt at a definition would probably go: it is an 
expression of the form 

n 

£ a***, 

k=0 

where we can add or delete zero terms without changing the polynomial, and x h is 
short for \x h . The degree of the polynomial is the exponent of the largest power of 
x which has a non-zero coefficient. And finally, changing the name of the variable 
doesn't change the polynomial. 

The rules for addition and multiplication are: 

addition: if f(x) = Y* a k* k an d g(x) = Y.t>kX k , we can assume that both sums run 
from to n (by adding some zero terms to one polynomial if necessary); 
then 

n 

f(x)+g(x) = Y,(a k + b k )x k . 
k=0 

multiplication: 

n \ [ m \ n+m 

£ a k x k £ btJ = £ d k x k , 

k=0 J \k=0 J k=0 

with 

k 

d k = Y* a l b k-h 

1=0 

where any terms in this sum whose subscripts are outside the correct range 
are taken to be zero. 

If this seems complicated, the rules are simply capturing the usual way of 
adding and multiplying polynomials. There is nothing mysterious here! 

Addition and multiplication of real polynomials are commutative and associa- 
tive operations. 

1.6 Permutations 

LetX be any set. A permutation of X is a function g : X — > X which is one-to-one 
and onto, that is, a bijection from X to X. 

There are several common notations for a permutation of the set {1, . . . ,«}. 
We illustrate these with the permutation of {1,2,3,4,5,6} which maps 1 — > 3, 

2 -> 4, 3 -> 5, 4 -> 2, 5 -> 1 and 6 -> 6. 
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Two-line notation: We write the numbers 1, . . . , n in a row, and under each num- 
ber we put its image under the permutation. In the example, this gives 

(\ 2 3 4 5 6\ 
\3 4 5 2 1 6J' 

One-line notation: We write just the second line of the two-line form. In the 
example, this would be ( 3 4 5 2 1 6). 

Cycle notation: We take the first point of the set, and follow what happens to 
it as we apply the permutation repeatedly. Eventually we return to the 
starting point. When this happens, we write the points and its images in 
a bracket, representing the cycle. If not every point is included, we repeat 
with a new point and produce another cycle, until no points are left. A point 
which is fixed by the permutation (mapped to itself) lies in a cycle of size 1; 
sometimes we don't write such cycles. In our example, this would give 
(1,3, 5) (2, 4) (6), or just (1,3, 5) (2, 4) if we choose to omit the cycle (6). 

Let S n be the set of all permutations of the set { 1 , . . . , n}. We have 

\S n \ =n\ = n(n — l)(n — 2) • • • 1. 

For consider the two-line representation. The top row is (12 ... n). The bottom 
row consists of the same numbers in any order. Thus there are n possibilities for 
the first entry in the bottom row; n — 1 possibilities for the second (anything except 
the first), n — 2 possibilities for the third; and so on. 

Now we define an operation on permutations as follows. If g is a permutation, 
denote the image of the element x E { 1 , . . . , n} by xg. (Warning: we write the 
function on the right of its input. That is, xg, not g(x) as you might expect.) Now 
if g and h are two permutations, their composition g\g2 is defined by 

x(gh) = (xg)h for all x £ {1,. . . ,n}. 

In other words the rule is "apply g, then h". 

For example, if g is the permutation (1,3, 5) (2, 4) (6) in our above example, 
and h = (1,2,3,4,5,6), then gh = (1,4, 3, 6)(2, 5). You are strongly urged to 
practice composing permutations given in cycle form! 
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2.1 Introduction 

A ring can be thought of as a generalisation of the integers, Z. We can add and 
multiply elements of a ring, and we are interested in such questions as factorisation 
into primes, construction of "modular arithmetic", and so on. 

2.1.1 Definition of a ring 

Our first class of structures are rings. A ring has two operations: the first is 
called addition and is denoted by + (with infix notation); the second is called 
multiplication, and is usually denoted by juxtaposition (but sometimes by ■ with 
infix notation). 

In order to be a ring, the structure must satisfy certain rules called axioms. We 
group these into three classes. The name of the ring is R. 

We define a ring to be a set R with two binary operations satisfying the fol- 
lowing axioms: 

Axioms for addition: 

(AO) (Closure law) For any a,b E R, we have a + b E R. 

(Al) (Associative law) For any a,b,cE R, we have (a + b) + c = a + (b + c). 

(A2) (Identity law) There is an element E R with the property that a + = 
+ a = a for all a £ R. (The element is called the zero element of 
R.) 

(A3) (Inverse law) For any element a E R, there is an element b E R satis- 
fying a + b = b + a = 0. (We denote this element b by —a, and call it 
the additive inverse or negative of a.) 
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(A4) (Commutative law) For any a,b E R, we have a + b = b + a. 

Axioms for multiplication: 

(MO) (Closure law) For any a, b E R, we have ab E R. 

(Ml) (Associative law) For any a,b,c E R, we have (ab)c = a(bc). 

Mixed axiom: 

(D) (Distributive laws) For any a,b,c E R, we have (a + b)c = ac + bc and 
c(a + b) = ca + cb. 

Remarks 1 . The closure laws (AO) and (MO) are not strictly necessary. If + is 
a binary operation, then it is a function from R x R to R, and so certainly a + b is 
an element of R for all a,b E R. We keep these laws in our list as a reminder. 

2. The zero element defined by (A2) and the negative —a defined by (A3) 
are not claimed to be unique by the axioms. We will see later on that there is only 
one zero element in a ring, and that each element has only one negative. 

Axioms (MO) and (Ml) parallel (AO) and (Al). Notice that we do not require 
multiplicative analogues of the other additive axioms. But there will obviously be 
some rings in which they hold. We state them here for reference. 

Further multiplicative properties 

(M2) (Identity law) There is an element 1 E R such that al = la = a for all 
a E R. (The element 1 is called the identity element of R.) 

(M3) (Inverse law) For any a E R, if a ^ 0, then there exists an element b E R 
such that ab = ba= 1. (We denote this element b by a~ l , and call it 
the multiplicative inverse of a.) 

(M4) (Commutative law) For all a,b ER, we have ab — ba. 

A ring which satisfies (M2) is called a ring with identity; a ring which satisfies 
(M2) and (M3) is called a division ring; and a ring which satisfies (M4) is called 
a commutative ring. (Note that the term "commutative ring" refers to the fact that 
the multiplication is commutative; the addition in a ring is always commutative!) 
A ring which satisfies all three further properties (that is, a commutative division 
ring) is called a. field. 
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2.1.2 Examples of rings 

1. The integers 

The most important example of a ring is the set Z of integers, with the usual 
addition and multiplication. The various properties should be familiar to you; we 
will simply accept that they hold. Z is a commutative ring with identity. It is not a 
division ring because there is no integer b satisfying 2b = I. This ring will be our 
prototype for several things in the course. 

Note that the set N of natural numbers, or non-negative integers, is not a ring, 
since it fails the inverse law for addition. (There is no non-negative integer b such 
that2 + fr = 0.) 

2. Other number systems 

Several other familiar number systems, namely the rational numbers Q, the 
real numbers E, and the complex numbers C, are fields. Again, these properties 
are assumed to be familiar to you. 

3. The quaternions 

There do exist division rings in which the multiplication is not commutative, 
that is, which are not fields, but they are not so easy to find. The simplest example 
is the ring of quaternions, discovered by Hamilton in 1843. 

On 16 October 1843 (a Monday) Hamilton 
was walking in along the Royal Canal with 
his wife to preside at a Council meeting of 
the Royal Irish Academy. Although his wife 
talked to him now and again Hamilton 
hardly heard, for the discovery of the 
quaternions, the first noncommutative [ring] 
to be studied, was taking shape in his mind. 
He could not resist the impulse to carve the 
formulae for the quaternions in the stone of 
Broome Bridge (or Brougham Bridge as he 
called it) as he and his wife passed it. 

Instead of adding just one element i to the real numbers, Hamilton added three. 
That is, a quaternion is an object of the form a + bi + cj + dk, where 

i 2 =j 2 = k 2 = -l, ij = -ji = k. jk = — kj = i, ki = — ik = j. 

It can be shown that all the axioms (A0)-(A4), (M0)-(M3) and (D) are satisfied. 
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For example, if a, b, c, d are not all zero, then we have 

/ , • • „ \ ( a ~ bi - c) - dk \ 

(a + bi + cj + dk) —~ — - — - = 1. 

v ; \a 2 + b 2 + c 2 + d 2 J 

The ring of quaternions is denoted by H, to commemorate Hamilton. 

4. Matrix rings 

We briefly defined addition and multiplication for matrices in the last chapter. 
The formulae for addition and multiplication of nxn matrices, namely 

n 

(A + B) ij =A ij +B ij , (AB)ij = Y,A ik B kj , 

k=l 

just depend on the fact that we can add and multiply the entries. In principle 
these can be extended to any system in which addition and multiplication are 
possible. However, there is a problem with multiplication, because of the L£ =1 , 
which tells us to add up n terms. In general we can only add two things at a time, 
since addition is a binary operation, so we have to make the convention that, for 
example, a + b + c means (a + b)+c, a + b + c + d means (a + b + c) + d, and so 
on. We will return to this point in the next subsection. 
Now we have the following result: 

Proposition 2.1 Let R be a ring. Then the set M n (R) of n x n matrices over R, 
with addition and multiplication defined in the usual way, is a ring. If R has an 
identity, then M n (R) has an identity; but it is not in general a commutative ring or 
a division ring. 

We will look at the proof later, once we have considered addition of n terms. 

5. Polynomial rings 

In much the same way, the usual rules for addition of polynomials, 

(£a ( -y) + (£ bp?) = £(a ; - + bi)x\ (E^O = 

where 

i 

di = a kbi-k, 

k=0 

can be extended to polynomials with coefficients in any algebraic structure in 
which addition and multiplication are defined. As for matrices, we have to be able 
to add an arbitrary number of terms to make sense of the definition of multiplica- 
tion. We have the result: 
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Proposition 2.2 Let R be a ring, then the set R[x] of polynomials over R, with 
addition and multiplication defined in the usual way, is a ring. IfR is commutative, 
then so is R[x]; ifR has an identity, then so does R[x]; but it is not a division ring. 

Again we defer looking at the proof. 
6. Rings of sets 

The idea of forming a ring from operations 
on sets is due to George Boole, who 
published in 1854 An investigation into the 
Laws of Thought, on Which are founded the 
Mathematical Theories of Logic and 
Probabilities. Boole approached logic in a 
new way reducing it to algebra, in much the 
same way as Descartes had reduced 
geometry to algebra. 

The familiar set operations of union and intersection satisfy some but not all 
of the ring axioms. They are both commutative and associative, and satisfy the 
distributive laws both ways round; but they do not satisfy the identity and inverse 
laws for addition. 

Boole's algebra of sets works as follows. Let ^(A), the power set of A, be 
the set of all subsets of the set A. Now we define addition and multiplication on 
^(A) to be the operations of symmetric difference and intersection respectively: 

x + y = xAy, xy = xHy. 

Proposition 2.3 The set &(A), with the above operations, is a ring; it is commu- 
tative, has an identity element, but is not afield if\A\ > 1. It satisfies the further 
conditions x + x = and xx = xfor all x. 

We won't give a complete proof, but note that the empty set is the zero element 
(since xA% = x for any set x), while the additive inverse — x of x is equal to x 
itself (since x Ax = for any x). Check the other axioms for yourself with Venn 
diagrams. 

A ring satisfying the further condition that xx = x for all x is called a Boolean 
ring. 
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7. Zero rings 

Suppose that we have any set R with a binary operation + satisfying the ad- 
ditive axioms (A0)-(A4). (We will see later in the course that such a structure is 
called an abelian group.) Then we can make R into a ring by defining x y = for 
all x,y E R. This is not a very exciting rule for multiplication, but it is easy to 
check that all remaining axioms are satisfied. 

A ring in which all products are zero is called a zero ring. It is commutative, 
but doesn't have an identity (if \R\ > 1). 

8. Direct sum 

Let R and S be any two rings. Then we define the direct sum R © S as follows. 
As a set, R © S is just the cartesian product R x S. The operations are given by the 
rules 

(ri,5i) + (r 2 ,5 2 ) = {n + r2,si+s 2 ), {r\,s\)(r 2 ,s 2 ) = {r\r 2 ,s\s 2 ). 

(Note that in the ordered pair [r\ + r 2 ,s\ + s 2 ), the first + denotes addition in R, 
and the second + is addition in S.) 

Proposition 2.4 If R and S are rings, then RQ)S is a ring. IfR and S are commu- 
tative, then so is R © S; ifR and S have identities, then so does R © S; but RQ)S is 
not a division ring if both R and S have more than one element. 

The proof is straightforward checking. 

9. Modular arithmetic 

Let Z n denote the set of all congruence classes modulo n, where n is a positive 
integer. We saw in the first chapter that there are n congruence classes; so Z„ is a 
set with n elements: 



There is an important job to do here: we have to show that these definitions 
don't depend on our choice of representatives of the equivalence classes. 




[a]n + [b] n = [a + b] 



[a] n [b] n = [ab}„. 



Proposition 2.5 For any positive integer n, 7L n is a commutative ring with identity. 
It is afield if and only ifn is a prime number. 
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Here, for example, are the addition and multiplication tables of the ring Z5. 
We simplify the notation by writing x instead of [x]s. 
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Note, for example, that 2 = 3 in this ring. 

10. Rings of functions 

The sum and product of continuous real functions are continuous. So there is 
a ring C(R) of coninuous functions from R to R, with 

(/ + *)(*) =/(*) + *(*), (/*)(*)=/(*)*(*). 

There are several related rings, such as C 1 (R) (the ring of differentiable functions), 
Cb(R) (the ring of continuous functions satisfying /(x) — > as * — > ±°°), and 
C([a,fe]) (the ring of continuous functions on the interval [a,b]. All these rings are 
commutative, and all except Cb(R) have an identity (the constant function with 
value 1). 

These rings are the subject-matter of Functional Analysis. 

2.1.3 Properties of rings 

We have some business deferred from earlier to deal with. After that, we prove 
some basic properties of rings, starting from the axioms. 

Uniqueness of zero element 

The zero element of a ring is unique. For suppose that there are two zero 
elements, say z\ and zi- (This means that a + z\ = Z\ + a = a for all a and also 
a + Z2 = Z2 + a = a for all a.) Then 

Z\ = Z\ +Z2 = Z2- 



Exercise: Show that the identity element of a ring, if it exists, is unique. 
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Uniqueness of additive inverse 

The additive inverse of an element a is unique. For suppose that b and c are 
both additive inverses of a. (This means that a + b = b + a = and a + c = c + a = 
- we know now that there is a unique zero element, and we call it 0.) Then 

b = b + = b+(a + c) = (b + a) + c = + c = c, 

where we use the associative law in the third step. 

Exercise: Show that the multiplicative inverse of an element of a ring, if it 
exists, is unique. 

Adding more than two elements 

The associative law tells us that if we have to add three elements, then the two 
possible ways of doing it, namely (a + b) + c and a + (b + c), give us the same 
result. For more than three elements, there are many different ways of adding 
them: we have to put in brackets so that the sum can be worked out by adding two 
elements at a time. For example, there are five ways of adding four elements: 

((a + b) + c)+d, (a + (b + c))+d, (a + b) + (c + d), a + ((b + c)+d), a + (b+(c+d)). 

These are all equal. For the associative law (a + b) + c) = a + (b + c) shows that 
the first and second are equal, while the associative law for b, c, d shows that the 
fourth and fifth are equal. Also, putting x = a + b,we have 

((a + b) + c)+d= (x + c)+d =x + (c + d) = (a + b) + (c + d), 

so the first and third are equal; and similarly the third and fifth are equal. 

In general we have the following. The proof works for any associative binary 
operation. 

Proposition 2.6 Let * be an associative binary operation on a set A, and a\ , . . . ,a n e 
A. Then the result of evaluating a\ * a2 * ■ ■ ■ * a n , by adding brackets in any way to 
make the expression well-defined, is the same, independent of bracketing. 

Proof The proof is by induction on the number of terms. For n = 2 there is 
nothing to prove; for n = 3, the statement is just the associative law; and for n = 4, 
we showed it above. Suppose that the result is true for fewer than n terms. Suppose 
now that we have two different bracketings of the expression a^ * a2* ■ ■ ■ * a n . 
The first will have the form (a\ * • • • * a,-) * (a ;+ i * • • • * a n ), with the terms inside 
the two sets of brackets themselves bracketed in some way. By induction, the 
result is independent of the bracketing of a\ , . . . ,a, and of , . . . , a n . Similarly, 
the second expression will have the form {a\ * • • • * aj) * * • • • * a n ), and is 
independent of the bracketing of a\ , . . . ,aj and of aj + \ , . . . , a n . 
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Case 1 : i = j. Then the two expressions are obviously equal. 

Case 2 : i ^ j; suppose, without loss, that i < j. Then the first expression 
can be written as 

(ai * • • • * ai) * ((fl(+i * • • • * aj) * * • • • * a n )), 

and the second as 

((«!*■• • *a ; -) * (a i+ i * • • • *a/)) * * • • • *««), 

where each expression is independent of any further bracketing. By the associative 
law, these two expressions are equal: they are x * (y * z) and (x * y) * z, where 
x = a\ * ■ ■ ■ * at, y = a ( -+i * • • • * aj, and z = * • • • * a n . 

Note that this result applies to both addition and multiplication in a ring. 

n 

As usual, we denote a\ + H V a n by ^ 

(=l 

Cancellation laws 

Proposition 2.7 In a ring R, ifa+x = b+x, thena = b. Similarly, ifx + a = x + b, 
then a = b. 



Proof Suppose that a + x = b + x, and let y = —x. Then 

a = a + = a + (x + y) = (a + x)+y= (b + x) +y = b + (x + y) =b + = b. 
The other law is proved similarly, or by using the commutativity of addition. 
These facts are the cancellation laws. 

A property of zero 

One familiar property of the integers is that Oa = for any integer a. We don't 
have to include this as an axiom, since it follows from the other axioms. Here 
is the proof. We have + = 0, so 0a + = 0a = (0 + 0)a = 0a + 0a, by the 
distributive law; so the cancellation law gives = 0a. Similarly a0 = 0. 

It follows that if R has an identity 1, and \R\ > 1, then 1^0. For choose any 
element a ^ 0; then la = a and 0a = 0. It also explains why we have to exclude 
in condition (M3): cannot have a multiplicative inverse. 
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Commutativity of addition 

It turns out that, in a ring with identity, it is not necessary to assume that 
addition is commutative: axiom (A4) follows from the other ring axioms together 
with (M2). 

For suppose that (A0)-(A3), (M0)-(M2) and (D) all hold. We have to show 
that a + b = b + a. Consider the expression ( 1 + 1 ) (a + b). We can expand this in 
two different ways by the two distributive laws: 

(l + l)(a + b) = l(a + b) + l(a + b) =a + b + a + b, 
(l + l)(a + b) = (l + l)a + (l + l)b = a + a + b + b. 

Hence a + b + a + b = a + a + b + b, and using the two cancellation laws we 
conclude that b + a = a + b. 

This argument depends on the existence of a multiplicative identity. If we take 
a structure with an operation + satisfying (A0)-(A3) (we'll see later that such a 
structure is known as a group), and apply the "zero ring" construction to it (that 
is, ab = for all a,b), we obtain a structure satisfying all the ring axioms except 
(A4). 

Boolean rings 

We saw that a Boolean ring is a ring R in which xx = x for all x G R. 
Proposition 2.8 A Boolean ring is commutative and satisfies x + x = for all 

xeR. 

Proof We have (x + y)(x + y) = x + y. Expanding the left using the distributive 
laws, we find that 

xx + xy + yx + yy = x + y. 
Now xx = x and yy = y. So we can apply the cancellation laws to get 

xy + yx = 0. 

In particular, putting y = x in this equation, we have xx + xx = 0, or x + x = 0, one 
of the things we had to prove. 

Taking this equation and putting xy in place of x, we have 

xy + xy = = xy+yx, 

and then the cancellation law gives us xy = yx, as required. 

We saw that the power set of any set, with the operations of symmetric dif- 
ference and intersection, is a Boolean ring. Another example is the ring Z2 (the 
integers mod 2). 
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2.1.4 Matrix rings 

In view of Proposition 2.6, the definition of the product of two nxn matrices now 
makes sense: AB = D, where 

n 

Dij = ^A ik B k j. 

k=l 

So we are in the position to prove Proposition 2.1. 

A complete proof of this proposition involves verifying all the ring axioms. 
The arguments are somewhat repetitive; I will give proofs of two of the axioms. 

Axiom (A2): Let be the zero element of the ring R, and let O be the zero 
matrix in M n (R), satisfying Oy = for all Then O is the zero element of 
M n {R): for, given any matrix A, 

(0+A)ij = +A tj = + Aij = Atj, (A + 0)ij = Ay + O tj = Ay + = Ay, 

using the properties of e R. So O + A =A + = A. 
Axiom (D): the (i, j) entry of A(B + C) is 

n n 

£ A lk (B + C) kj = £ A ik B kj +A ik C kj , 

k=l k=\ 

by the distributive law in R; and the (i, j) entry of AB +AC is 

n n 

£ Afjfcfljy + £ A ik C kj . 
k=l k=\ 

Why are these two expressions the same? Let us consider the case n = 2. The first 
expression is 

AnBij +A n Cij +A i2 B 2j +A i2 C 2 j, 

while the second is 

AnBij +A l2 B 2j +A n C lj +A i2 C 2j . 

(By Proposition 2.6, the bracketing is not significant.) Now the commutative law 
for addition allows us to swap the second and third terms of the sum; so the two 
expressions are equal. Hence A(B + C) = AB +AC for any matrices A, 5, C. For 
n > 2, things are similar, but the rearrangement required is a bit more complicated. 
The proof of the other distributive law is similar. 

Observe what happens in this proof: we use properties of the ring R to deduce 
properties of M n (R). To prove the distributive law for M n (R), we needed the dis- 
tributive law and the associative and commutative laws for addition in R. Similar 
things happen for the other axioms. 
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2.1.5 Polynomial rings 

What exactly is a polynomial? We deferred this question before, but now is the 
time to face it. 

A polynomial Y. a i x ' is completely determined by the sequence of its coeffi- 
cients ciq, a\ , These have the property that only a finite number of terms in the 

sequence are non-zero, but we cannot say in advance how many. So we make the 
following definition: 

A polynomial over a ring R is an infinite sequence 

(«()(>0 = («0>«l>- • •) 

of elements of R, having the property that only finitely many terms are non-zero; 
that is, there exists an n such that at = for all i > n. If a n is the last non-zero 
term, we say that the degree of the polynomial is n. (Note that, according to this 
definition, the all- zero sequence does not have a degree.) 
Now the rules for addition and multiplication are 

(a f ) + (bj) = (ci) where q = a t + b u 

i 

(ai)(bi) = (di) where di = £ ajbi-j. 

j=o 

Again, the sum in the definition of multiplication is justified by Proposition 2.6. 
We think of the polynomial (a ( -)j>o of degree n as what we usually write as 

Y!i=o a i xl > me ru ^ es we S ave a g ree with the usual ones. 

Now we can prove Proposition 2.2, asserting that the set of polynomials over a 
ring R is a ring. As for matrices, we have to check all the axioms, which involves 
a certain amount of tedium. The zero polynomial required by (A2) is the all-zero 
sequence. Here is a proof of (Ml). You will see that it involves careful work with 
dummy subscripts! 

We have to prove the associative law for multiplication. So suppose that / = 
(cii), g = (bi) and h = (ci). Then the ith term of fg is Y!j=o a jbi-j, and so the ?'th 
term of ( fg)h is 

E (E^h) c i-k- 

k=0 \j=0 J 

Similarly the ?'th term of f(gh) is 

i ( i—s 

E a s E b t c is-t 
s=Q \t=0 

Each term on both sides has the form a p b q c r , where p,q,r>Q and p + q + r = i. 
(In the first expression, p = j, q = k — j, r = i — k; in the second, p = s, q = t, 
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r = i — s — t.) So the two expressions contain the same terms in a different order. 
By the associative and commutative laws for addition, they are equal. 

2.2 Subrings 

2.2.1 Definition and test 

Suppose that we are given a set S with operations of addition and multiplication, 
and we are asked to prove that it is a ring. In general, we have to check all the 
axioms. But there is a situation in which things are much simpler: this is when 
S is a subset of a set R which we already know to be a ring, and the addition and 
multiplication in S are just the restrictions of the operations in R (that is, to add 
two elements of 5, we regard them as elements of R and use the addition in R). 

Definition Let R be a ring. A subring of R is a subset S of R which is a ring in 
its own right with respect to the restrictions of the operations in R. 
What do we have to do to show that S is a subring? 

• The associative law (Al) holds in S. For, if a,b,c E S, then we have a,b,c E 
R (since S C R), and so 

(a + b) + c = a + (b + c) 
since R satisfies (Al) (as we are given that it is a ring). 

• Exactly the same argument shows that the commutative law for addition 
(A4), the associative law for multiplication (Ml), and the distributive laws 
(D), all hold in S. 

• This leaves only (AO), (A2), (A3) and (MO) to check. 

Even here we can make a simplification, if S ^ 0. For suppose that (AO) and 
(A3) hold in S. Given a E S, the additive inverse —a belongs to S (since we are 
assuming (A3)), and so = a + (—a) belongs to S (since we are assuming (AO)). 
Thus (A2) follows from (AO) and (A3). 
We state this as a theorem: 

Theorem 2.9 (First Subring Test) Let R be a ring, and let S be a non-empty sub- 
set ofR. Then S is a subring ofR if the following condition holds: 

for all a,b E S, we have a + b, ab, —a E S. 
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Example We show that the set S of even integers is a ring. Clearly it is a non- 
empty subset of the ring Z of integers. Now, if a,b G S, say a = 2c and b = 2d, 
we have 

a + b = 2(c + d) eS, ab = 2(2cd) G 5, -a = 2(-c) G S, 

and so S is a subring of Z, and hence is a ring. 

The theorem gives us three things to check. But we can reduce the number 
from three to two. We use a — b as shorthand for a + (—b). In the next proof we 
need to know that — (— b) = b. This holds for the following reason. We have, by 
(A3), 

b + (-b) = (-b)+b = 0, 

so that b is an additive inverse of —b. Also, of course, —(—b) is an additive 
inverse of —b. By the uniqueness of additive inverse, —(—b) = b, as required. In 
particular, a — (—b) = a + (— (— b)) = a + b. 

Theorem 2.10 (Second Subring Test) Let R be a ring, and let S be a non-empty 
subset ofR. Then S is a subring ofR if the following condition holds: 

for all a,b G S, we have a — b,ab<ES. 

Proof Let S satisfy this condition: that is, S is closed under subtraction and mul- 
tiplication. We have to verify that it satisfies the conditions of the First Subring 
Test. Choose any element a G S (this is possible since S is non-empty). Then 
the hypothesis of the theorem shows that = a — a G S. Applying the hypothesis 
again shows that — a = — a G S. Finally, if a, b G S, then —beS (by what has 
just been proved), and so a + b = a — (—b) G S. So we are done. 

2.2.2 Cosets 

Suppose that S is a subring of R. We now define a partition of R, one of whose 
parts is S. Remember that, by the Equivalence Relation Theorem, in order to 
specify a partition of R, we must give an equivalence relation on R. 
Let =5 be the relation on R defined by the rule 

a=sb if and only if b — aeS. 

We claim that =5 is an equivalence relation. 

Reflexive: for any a G R, a — a = G S, so a =5 a. 

Symmetric: take a.beR with a =5 b, so that b — aeS. Then a — b = —(b — a) G 
S, so b =5 a. 
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Transitive: take a,b,c E R with a =5 b and & =5 c. Then b — a,c — b E S. So 
c — a = (c — + (b — a) E S, so a =5 c. 

So =5 is an equivalence relation. Its equivalence classes are called the cosets 
of S in R. 

Example Let n be a positive intger. Let R = Z and 5 = «Z, the set of all mul- 
tiples of n. Then 5 is a subring of R. (By the Second Subring Test, if a,b E S, 
say a — nc and b = nd, then a — b = n(c — d)ES and a& = n(ncd) E S.) In this 
case, the relation =5 is just congruence mod n, since a =5 b if and only if £ — a is 
a multiple of «. The cosets of S are thus precisely the congruence classes mod n. 

An element of a coset is called a coset representative. As we saw in the first 
chapter, it is a general property of equivalence relations that any element can be 
used as the coset representative: if b is in the same equivalence class as a, then a 
and b define the same equivalence classes. We now give a description of cosets. 

If 5 is a subset of R, and a E R,we define S + a to be the set 

S + a = {s + a : s E S} 
consisting of all elements that we can get by adding a to an element of S. 

Proposition 2.11 Let S be a subring ofR, and aER. Then the coset ofR contain- 
ing a is S + a. 

Proof Let [a] denote the coset containing a, that is, 

[a] = {bER:a= s b} = {bER:b-aES}. 

We have to show that [a] = S + a. 

First take b E [a], so that b — aES. Let s = b — a. Then b = s + a E S + a. 

In the other direction, take b E S + a, so that b = s + a for some s E S. Then 
b — a = (s + a) — a = s E S, so b =5 a, that is, b E [a]. 

So [a] = S + a, as required. 

Any element of a coset can be used as its representative. That is, if b E S + a, 
thenS + a = S + b. 
Here is a picture. 
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Note that 5 + = 5, so the subring 5 is a coset of itself, namely the coset 
containing 0. 

In particular, the congruence class [a] n in Z is the coset nZ + a, consisting of 
all elements obtained by adding a multiple of n to a. So the ring Z is partitioned 
into n cosets of riL. 



2.3 Homomorphisms and quotient rings 
2.3.1 Isomorphism 

Here are the addition and multiplication tables of a ring with two elements, which 
for now I will call o and /. 
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o 


i 




o 


i 


o 


o 


i 


o 


o 


o 


i 


i 


o 


i 


o 


i 



You may recognise this ring in various guises: it is the Boolean ring £P(X), where 
X = {x} is a set with just one element x; we have o = and i = {x}. Alternatively 
it is the ring of integers mod 2, with o = [0]2 and i = [1]2. 

The fact that these two rings have the same addition and multiplication tables 
shows that, from an algebraic point of view, we cannot distinguish between them. 

We formalise this as follows. Let R\ and R2 be rings. Let 6 : R\ — > R2 be 
a function which is one-to-one and onto, that is, a bijection between R\ and R2. 
Now we denote the result of applying the function to an element r e Ri by rO or 
(r) 6 rather than by 6 (r) ; that is, we write the function on the right of its argument. 

Now we say that is an isomorphism from R[ to R2 if it is a bijection which 
satisfies 

(r 1 +r 2 )e = r 1 + r 2 0, (nn)6 = ( ri 6)fa6). (2.1) 

This means that we "match up" elements in R\ with elements in R2 so that addi- 
tion and multiplication work in the same way in both rings. 
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Example To return to our earlier example, let R\ = £P({x}) and let R2 be the 
ring of integers mod 2, and define a function : R\ — > R2 by 



Then is an isomorphism. 

We say that the rings R\ and R2 are "isomorphic" if there is an isomorphism 
from Ri to R2. The word "isomorphic" means, roughly speaking, "the same 
shape": if two rings are isomorphic then they can be regarded as identical from 
the point of view of Ring Theory, even if their actual elements are quite different 
(as in our example). We could say that Ring Theory is the study of properties of 
rings which are the same in isomorphic rings. 

So, for example, if R\ and R2 are isomorphic then: 

• If R\ is commutative, then so is R2, and vice versa; and the same holds for 
the property of being a ring with identity, a division ring, a Boolean ring, a 
zero ring, etc. 

• However, the property of being a ring of matrices, or a ring of polynomials, 
etc., are not necessarily shared by isomorphic rings. 

We use the notation Ri = R2 to mean is isomorphic to R2". Remember 
that isomorphism is a relation between two rings. If you are given two rings Ri 
and R2 and asked whether they are isomorphic, do not say is isomorphic but 
R 2 is not". 

2.3.2 Homomorphisms 

An isomorphism is a function between rings with two properties: it is a bijection 
(one-to-one and onto), and it preserves addition and multiplication (as expressed 
by equation (2.1)). A function which preserves addition and multiplication but 
is not necessarily a bijection is called a homomorphism. Thus, a homomorphism 
from R\ to R2 is a function : R\ — > R2 satisfying 



You should get used to these two long words, and two others. A function : R\ 
R 2 is 

• a homomorphism if it satisfies (2.1); (homo=similar) 

• a monomorphism if it satisfies (2.1) and is one-to-one; (mono=one) 

• an epimorphism if it satisfies (2.1) and is onto; (epi=onto) 



©0 - [0] 2 , 



{x}6 = [l} 2 . 



(r l +r 2 )0 = n6 + r 2 e, 



(nr 2 )0 = (n0)(r 2 0). 
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• an isomorphism if it satisfies (2.1) and is one-to-one and onto (iso=equal) 

For example, the function from the ring Z to the ring of integers mod 2, which 
takes the integer n to its congruence class [n\2 mod 2, is a homomorphism. Basi- 
cally this says that, if we only care about the parity of an integer, its congruence 
mod 2, then the addition and multiplication tables are 



+ 


even 


odd 




even 


odd 


even 


even 


odd 


even 


even 


even 


odd 


odd 


even 


odd 


even 


odd 



and this ring is the same as the one at the start of this section. 

Let : R\ — > R2 be a homomorphism. The image of is, as usual, the set 

lm(0) = {s e R2 : s = rd for some r e R\}. 

We define the kernel of to be the set 

Ker(0) = {reRi :r0=O}, 

the set of elements of R\ which are mapped to the zero element of R2 by 0. You 
will have seen a definition very similar to this in Linear Algebra. 

The image and kernel of a homomorphism have an extra property. This is not 
the final version of this theorem: we will strengthen it in two ways in the next two 
sections. First, a lemma: 

Lemma 2.12 Let : Ri — > R2 be a homomorphism. Then 

(a) 00 = 0; 

(b) (—a) 6 = -(ad) for all a e Ri; 

(c) (a-b)Q = ad -bO for all a,b eR\. 
Proof We have 

+ 00 =00 = (0 + 0)0 =00 + 00, 

and the cancellation law gives 00 = 0. 
Then 

a0 + (-a)0 = (a-a)0 =00 = 0, 

so (— a)6 is the additive inverse of a6, that is, (—1)0 = —(ad). 
Finally, (a-b)0 = aO + (-b)0 = a6 - bO. 

Proposition 2.13 Let : R\ — * R2 be a homomorphism. Then 
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(a) lm(0) is a subring ofR2,' 

(b) Ker(0) is a subring ofR\. 

Proof We use the Second Subring Test. 

(a) First notice that lm(0) 7^ 0, since lm(0) contains 0, by the Lemma. 
Take a,b G lm(0), say, a = xO and b = yO. Then —b = (—y)0, so 



Also ab = (x0)(y0) = (xy)6 G lm(0). So lm(0) is a subring of R 2 . 

(b) First notice that Ker(0) 7^ 0, since Ker(0) contains 0, by the Lemma. 
Take a,b G Ker(0), so that a6 = bG = 0. Then 



An ideal in a ring is a special kind of subring. 

Let S be a subring of R. We say that S is an ideal if, for any a G S and r G R, 
we have ar G S and ra G 5. 

For example, let R = Z and S = nZ for some positive integer n. We know that 
S is a subring of R. Choose a G 5, say a = nc for some c £ Z. Then ar = ra = 
n(cr) G 5. So 5 is an ideal. 

Any ring 7? has two trivial ideals: the whole ring i? is an ideal; and the set {0} 
consisting only of the zero element is an ideal. 

There is an ideal test similar to the subring tests. We give just one form. 

Theorem 2.14 (Ideal Test) Let R be a ring, and S a non-empty subset ofR. Then 
S is an ideal if the following conditions hold: 

(a) for all a,b G S, we have a — bES; 

(b) for all a G S and r G R, we have ar, ra G S. 



a ^b = x6 + (-y)6 = (x-y)6 Glm(0). 



(a-b)8 
(ab)6 



ad -b0 = 0-0 = 0, 
(ad)(bG) =0-0 = 0, 



so Ker(0) is a subring. 



2.3.3 Ideals 



Proof Take a,b G S. Then ab G S (this is a special case of (b), with r = b). So by 
the Second Subring Test, S is a subring. Then by (b), it is an ideal. 
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Now we can strengthen the statement that the kernel of a homomorphism is a 
subring. 

Proposition 2.15 Let d : R\ — > R2 be a homomorphism. Then Ker(0) is an ideal 
in R\. 

Proof We aheady know that it is a subring, so we only have to check the last part 
of the definition. So take a E Ker(0) (so that ad = 0), and r ER\. Then 

{ar)6 = (aO)(rO) = 0(rO) = 0, 

and similarly (ra)9 = 0. So ar,ra E Ker(0). 

We will see in the next section that it goes the other way too: every ideal is 
the kernel of a homomorphism. So "ideals" are the same thing as "kernels of 
homomorphisms". 

2.3.4 Quotient rings 

Let / be an ideal of a ring R. We will define a ring, which we call the quotient ring 
or factor ring, of R by /, and denote by R/I. 

The elements of R/I are the cosets of / in R. Thus each element of R/I is a set 
of elements (an equivalence class) of R. Remember that each coset can be written 
as / + a for some a E R. Now we have to define addition and multiplication. We 
do this by the rules 

(I + a) + (I + b) = I+(a + b), 
(I + a)(I + b) = I + ab. 

There is one important job that we have to do to prove that this is a good 
definition. Remember that any element of a coset can be used as a representative. 
So you might use the representatives a and b, while I use the representatives a' 
and b' for the same cosets. We need to show that the definitions don't depend on 
these choices; that is, we have to show that 

I+ a = I+a' andl+b = l+b' imply I+(a + b) =I+(a'+b') and I+ab = I +a'b' . 

So suppose that I + a = I + a' and I + b = I + b'. Then a' E / + a, so a' = s + a 
for some s El. Similarly, b' = t + b for some t El. Now 



a' + b' = (s + a) + (t + b) = (s + t) + (a + b) E I + (a + b), 
ab' = (s + a)(t + b) = st + sb + ta + ab E I + ab, 
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by using the associative and commutative laws for addition and the distributive 
laws. So the result is proved, once we justify the last step by showing that s + t El 
and st + sb + at E I. Remember that s,t E I, so that s + 1 E I (as / is a subring); 
also st E I (since / is a subring) and sb El and at E I (since / is an ideal), so the 
sum of these three expressions is in /. 

Proposition 2.16 If I is an ideal of the ring R, then the setR/I, with operations of 
addition and multiplication defined as above, is a ring, and the map 6 : R — > R/I 
defined by rO = I + r is a homomorphism whose kernel is I. 

Proof We have well-defined operations of addition and multiplication, so (AO) 
and (MO) hold. The proofs of the other axioms are all very similar. Here is a proof 
of the first distributive law. Take three elements of R/I (that is, three cosets!), say 
I + a,I + b,I + c. Then 

((/ + «) + (I + b))(I + c) = (I+(a + b))(I + c) 

= I+(a + b)c 

= I+(ac + bc) 

= (I + ac) + (I + bc) 

= (I + a)(I + c) + (I + b)(I + c). 

Here we use the distributive law in R to get from the second line to the third, while 
the other steps just use the definitions of addition and multiplication in R/I. 
Next we show that is a homomorphism. This is true by definition: 

(a + b)6 = (I + a) + (I + b) = I + (a + b) = (a + b)6, 
(ab)6 = (I + a)(I + b) = I + (ab) = (ab)6. 

Finally we calculate Ker(0). There is one important thing to note. The zero 
element of R/I is the coset 1 + 0. This is just the ideal / itself! So 

Ker(e) = {a E R : ad = 0} = {a E R : I + a = 1} = I, 

since I + a = I means that a is a representative for the coset /, that is, a El. 

The map 9 in this result is called the natural homomorphism from R to R/I. 
We see that, if / is any ideal of R, then / is the kernel of the natural homomorphism 
from R to R/I. 
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2.3.5 The Isomorphism Theorems 

The Isomorphism Theorems are a number of results which look more closely at 
a homomorphism. The first one makes more precise the results we saw earlier 
about the image and kernel of a homomorphism. 

Theorem 2.17 (First Isomorphism Theorem) Let R\ and R2 be rings, and let 
G : R\ — > R2 be a homomorphism. Then 

(a) lm(0) is a subring ofR%; 

(b) Ker(0) is an ideal ofR\; 

(c) J?i/Ker(0) ^lm(0). 

Proof We already proved the first two parts of this theorem, in Propositions 
2.13 and 2.15. We have to prove (c). Remember that this means that the rings 
i?i/Ker(0) (the quotient ring, which is defined because Ker(0) is an ideal in R{) 
and lm(0) (a subring of R2) are isomorphic. We have to construct a map be- 
tween these two rings which is one-to-one and onto, and is a homomorphism. 
Put 7 = Ker(0), and define <j) by the rule 

(7+r)0 = r0 

for r E R\. On the face of it, this might depend on the choice of the coset rep- 
resentative r. So first we have to prove that, if 7 + r = I + r', then r6 = r'6. We 
have 

I + r = I + r =>• r' = s + r for some sG 7 = Ker(0) 
r'6 =s6+r6 =0 + r6 = r6, 

as required. So indeed is well defined. 

In fact this argument also reverses. If r6 = r'6, then (r' — r)9 = r'6 — r6 = 0, 
so r' — r E Ker(0). This means, by definition, that r and r' lie in the same coset of 
Ker(0) = I, so that / + r = 7 + r 1 . This shows that is one-to-one. 

To show that is onto, take s E lm(0). Then s = r9 for some r E R, and we 
have s = rO = (I + r)<j>. So lm(0) = lm(0) as required. 

Finally, 

((/ + r 1 ) + (/ + r 2 ))0 = (r 1 + r 2 )0 = ( n 0) + (r 2 0) = (7+ n )<j> + (7 + r 2 )0, 
((I + n)(I + r 2 ))(}) = (nr2)e = (n0)(r 2 0) = (7 + n)0(7 + r 2 )0, 



so is a homomorphism, and hence an isomorphism, as required. 
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Figure 2.1: A homomorphism 
We illustrate this theorem with a picture. 

In the picture, the parts into which Ri is divided are the cosets of the ideal 
ker(0) (the set Ker(0) itself has been taken to be the top part of the partition). 
The oval region inside R2 is the subring lm(0). Each coset of Ker(0) maps to a 
single element of lm(6). 

The second Isomorphism Theorem is sometimes called the "Correspondence 
Theorem", since it says that subrings of R/I correspond in a one-to-one manner 
with subrings of R containing /. 

Theorem 2.18 (Second Isomorphism Theorem) Let I be an ideal of the ring R. 
Then there is a one-to-one correspondence between the subrings of R/I and the 
subrings of R containing I, given as follows: to a subring S of R containing I 
corresponds the subring S/I of R/I. Under this correspondence, ideals of R/I 
correspond to ideals ofR containing I; and, if J is an ideal ofR containing I, then 

(R/I) / (J /I) 91 R/ J. 

Proof If S is a subring of R containing /, then / is an ideal of S. (For applying 
the ideal test inside S means we have to check that / is closed under subtraction 
and under multiplication by elements of 5; these are just some of the checks that 
would be required to show that it is an ideal of R. Now if s E S, then the entire 
coset I + s lies in S, since S is closed under addition. So S/I is well-defined: it 
consists of all the cosets of / which are contained in S. Clearly it is a subring of 
R/I. Thus, we have a mapping from subrings of R containing / to subrings of R/I. 

In the other direction, let T be a subring of R/I. This means that T is a set of 
cosets of / which form a ring. Let S be the union of all the cosets in T . We will 
show that S is a subring of R. It obviously contains / (since / is the zero coset) and 
S/I = T follows. 
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Take a,b E S. Then I + aJ + b ET. Since T is a subring, we have (7 + a) — 
(I + b) =I+(a-b) ET and + (I + b) = I + ab E T, so a- b E S and ab E S. 
By the Second Subring Test, S is a subring. 

Next we show that ideals correspond to ideals. Let J be an ideal of R con- 
taining I. Then J /I is a subring of R/I, and we have to show that it is an ideal. 
Take I + a E J/I and I + r E R/I. Then a E J and r E R, so ar, ra E J, whence 
(I + a)(I + r),(I+r)(I + a) E J/I. Thus J/I is an ideal of R/I. The converse is 
similar. 

I will not give the proof that (R/I) /(J /I) = R/J: this will not be used in the 
course. 

The Third Isomorphism Theorem needs a little more notation. Let A and B be 
two subsets of a ring R. Then we define A + 5 to consist of all sums of an element 
of A and an element of B : 

A+B = {a + b : a E A,b E B}. 

Theorem 2.19 (Third Isomorphism Theorem) Let R be a ring, S a subring of 
R, and I an ideal ofR. Then 

(a) S + 1 is a subring ofR containing I; 

(b) SHI is an ideal ofS; 

(c) s/(sni)^(s+i)/i. 

Proof We could prove the three parts in order, but it is actually easier to start 
at the end! Remember the natural homomorphism 6 from R to R/I with kernel 
0. What happens when we restrict to S, that is, we only put elements of S into 
the function 0? Let <j> denote this restriction. Then maps S to R/I. We find its 
image and kernel, and apply the First Isomorphism Theorem to them. 

(a) The image of consists of all cosets I + s containing a coset representative 
in S. The union of all these cosets is I + S, so the image of is (I + S)/I. 
This is a subring of R/I (since it is the image of a homomorphism). By the 
Correspondence Theorem, S + 1 is a subring of R containing I. 

(b) The kernel of consists of all elements of S mapped to zero by 0, that is, 
all elements s E S such that s E Ker(0) = I. Thus, Ker(0) = SHI, and so 
S fl I is an ideal of S. 

(c) Now the first isomorphism theorem shows that 

S/(I + S)^Im((j>) = (I + S)/I, 

and we are done. 
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2.4 Factorisation 

One of the most important properties of the integers is that any number can be 
factorised into prime factors in a unique way. But we have to be a bit careful. It 
would be silly to try to factorise or 1 ; and the factorisation is not quite unique, 
since (—2) ■ (—3) = 2-3, for example. Once we have the definitions straight, we 
will see that "unique factorisation" holds in a large class of rings. 

2.4.1 Zero divisors and units 

In this section, we will assume that our rings are always commutative. 

Let R be a ring. We know that Oa = holds for all a E R. It is also possible 
for the product of two non-zero elements of R to be zero. We say that a is a 

zero-divisor if 

• a 7^ 0, and 

• there exists b E R, with b ^ 0, such that ab = 0. 

In other words, if the product of two non-zero elements is zero, then we call each 
of them a zero-divisor. 

The ring Z has no zero-divisors, since if a and b are non-zero integers then 
obviously ab ^ 0. Also, a field has no zero divisors. For suppose that R is a field, 
and let a be a zero-divisor. Thus, a ^ 0, and there exists b ^ such that ab = 0. 
Since R is a field, a has a multiplicative inverse aT x satisfying aT x a = 1. Then 

= a -1 = a-\ab) = (a^ l a)b = \b = b, 

contradicting our assumption that b ^ 0. 

In the next example, we use the greatest common divisor function for integers: 
J is a greatest common divisor of a and b if it divides both of them, and if any 
other divisor of a and b also divides d. That is, 6 is a greatest common divisor 
of 12 and 18; but —6 is also a greatest common divisor. We will live with this 
slight awkwardness for a while, choosing gcd(a,&) to be the positive rather than 
the negative value. 

Example Let R = Z/nZ, the ring of integers mod n. Then the element a E R is 
a zero-divisor if and only if 1 < gcd(a,n) < n. 

Proof Suppose that a is a zero-divisor in R. This means that a ^ in R (that is, 
a is not divisible by n, which shows that gcd(a,n) < n), and there exists b E R 
with b 7^ and ab = 0. So, regarding a,b,n as integers, we have n \ ab but n 
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doesn't divide either a or b. We are trying to prove that gcd(a, n) > 1, so suppose 
(for a contradiction) that the greatest common divisor is 1. Since n and a are 
coprime, the fact that n divides ab means that n must divide b, which contradicts 
our assumption that b ^ in R. 

Conversely, suppose that 1 < d = gcd(a,n) < n. Then a ^ as an element of 
R. Let a = dx and n = db. Then n divides nx = (db)x = (dx)b = ab, but clearly 
n doesn't divide y. So, in the ring R, we have ab = and b ^ 0. Thus a is a 
zero-divisor. 

From now on we make another assumption about our rings: as well as being 
commutative, they will always have an identity element. We make a definition: 

An integral domain is a commutative ring with identity which has no zero- 
divisors. 

Example Z is an integral domain. (This example is the "prototype" of an inte- 
gral domain, and gives us the name for this class of rings.) Any field is an integral 
domain. The ring Z/nZ is an integral domain if and only if n is a prime number. 

The last statement is true because a positive integer n has the property that 
every smaller positive integer a satisfies gcd(a,n) = 1 if and only if n is prime. 

Example If R is an integral domain, then so is the ring R[x] of polynomials over 
R. 

For suppose that / and g are non-zero polynomials, with degrees m and n 
respectively: that is, 

n m 

f(x) = £ a t x\ g(x) = £ b t x\ 

i=0 i=0 

where a n ^ and b m ^ 0. The coefficient of x m+n in f(x)g(x) is a n b m ^ (because 
R is an integral domain). So f(x)g(x) ^ 0. 

Let R be a ring with identity element 1; we assume that 1^0, Let a E R, with 
a 7^ 0. An inverse of a is an element b E R such that ab = ba = \. We say that a us 
a unit if it has an inverse. (We exclude zero because obviously has no inverse: 
0b = for any element b.) 

An element a has at most one inverse. For suppose that b and c are inverses of 
a. Then 

b = bl = b(ac) = (ba)c = ac = c. 

We write the inverse of the unit a as aT l . Furthermore, a zero-divisor cannot be a 
unit. For, \iba=\ and ac = 0, then 



= bO = b(ac) = (ba)c = lc = c. 
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Lemma 2.20 Let R be a ring with identity. Then 

(a) I is a unit; 

(b) ifu is a unit then so is u~ l ; 

(c) ifu and v are units then so is uv. 

Proof (a) 1-1 = 1. 

(b) The equations uu~ l = u~ 1 u= 1 show that the inverse of u~ l is u. 

(c) Let u and v be units. We claim that the inverse of uv is v _1 w _1 . (Note the 
reverse order!) For we have 



To help you remember that you have to reverse the order when you find the 
inverse of a product, this example may help. Suppose that u is the operation of 
putting on your socks, and v the operation of putting on your shoes, so that uv 
means "put on your socks and then your shoes". What is the inverse of uvl 

Example In the integral domain Z, the only units are +1 and —1. For if ab = 1, 
then a = I or a = —I. 

Example Consider the ring Z/nZ, where n > 1. We already saw that a is a 
zero-divisor if and only if 1 < gcd(a,n) < n. We claim that a is a unit if and only 
if gcd(a,n) = 1. 

Suppose first that a is a unit, and that d = gcd(a,n). Then d \ a and d \ n. Let 
b be the inverse of a, so that ab = 1 in R, which means that ab = 1 (mod n), or 
ab = xn + 1 . But then d divides ab and d divides xn, so d divides 1 , whence d = 1 . 

To prove the converse, we use the Euclidean algorithm (more about this shortly), 
which shows that, given any two integers a and n, there are integers x and y such 
that xa+yn = d, where d = gcd(a,n). If d = 1, then this equation shows that 
xa = 1 (mod n), so that xa = 1 in Z/nZ, so that a is a unit. 

This shows that every non-zero element oiZ/nZ is either a zero-divisor or a 
unit. 



(uv)(v u ) 
( V - l u- l )(uv) 



u{yv )u =u\u = uu = 1, 
v _1 (u~ 1 u)v = v _1 lv = v _1 v = 1. 
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For example, for n = 12, we have: 
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zero-divisor 
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zero-divisor 


10 
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We call two elements a,b E R associates if there is a unit u E R such that 
b = ua. Write a ~ b to mean that a and b are associates. Thus, any unit is an 
associate of 1, while is associate only to itself. 

Being associates is an equivalence relation: it is 

• reflexive since a = al and 1 is a unit; 

• symmetric since, if b = au, then a = bu~ l , and iC x is a unit; 

• transitive since, if b = au and c = bv where u and v are units, then c = a(uv), 
and uv is a unit. 

Here we have invoked the three parts of the lemma above about units. 
For example, in the ring Z/12Z, the associate classes are 

{0}, {1,5,7,11}, {2,10}, {3,9} {4,8} {6}. 

For example, the associate class containing 2 consists of 2, 2 • 5 = 10, 2 • 7 = 2, 
and 2- 11 = 10. 

Now we can define greatest common divisors properly. 

Let R be an integral domain. (Remember: this means that R is a commutative 
ring with identity and has no divisors of zero.) We say that a divides bmR (written 
as usual as a \ b) if there exists xER with b = ax. Notice that every element divides 
0, whereas doesn't divide anything else except 0. Also, 1 divides any element 
of R, but the only elements which divide 1 are the units of R. [Check all these 
claims!] 

Proposition 2.21 In an integral domain R, two elements a and b are associates if 
and only if a\b and b \ a. 
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Proof Suppose that a and b are associates. Then b = au for some unit u, so a \ b. 
Also a = bu~ l , so b | a. 

Conversely, suppose that a \ b and b\ a. If a = 0, then also = and a,b 
are associates. So suppose that a^O. Then there are elements x and y such that 
b = ax and a = fry. We have axy = a, so a(\ — xy) =0. Since R is an integral 
domain and a ^ 0, we must have 1 — xy = 0, or xy = 1 . So x and y axe units, and 
a and b are associates. 

Now we say that J is a greatest common divisor of a and b if 

• J | a and J | 

• if e is any element such that e | a and e | then e | fr 

We abbreviate "greatest common divisor" to gcd. 

Notice that, in general, "greatest" does not mean "largest" in any obvious way. 
Both 6 and —6 are greatest common divisors of 12 and 18 in Z, for example. 

Proposition 2.22 If d is a gcd of two elements a,b in an integral domain R, then 
another element d' is a gcd of a and b if and only if it is an associate of d. 

Proof Suppose first that d and d' are both geds of a and b. Then d' \ d and d \ d' 
(using the second part of the definition), so that d and d' are associates. 

Conversely, suppose that d is a gcd of a and b (say a = dx and b = dy), and d' 
an associate of d, say d' = du for some unit u. Then 

• a = d'iC x x and b = d'uT l y, so d' \ a and d' \ b; 

• suppose that e \ a and e \ b. Then e \ d, say d = ez; so we have d' = euT l z 
and e \ d' . 

Thus d' is a gcd of a and b. 

Thus we can say: the greatest common divisor of a and b, if it exists, is "unique 
up to associate", that is, any two geds are associates. We use the notation gcd(a, b) 
to denote some (unspecified) greatest common divisor. In the integers, we can 
make the convention that we choose the non-negative element of the associate 
pair as the gcd. 
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2.4.2 Unique factorisation domains 

We are interested in the property of "unique factorisation" of integers, that is, any 
integer other than 0, + 1 , — 1 can be uniquely factorised into primes. Of course, the 
factorisation is not quite unique, for two reasons: 

(a) the multiplication is commutative, so we can change the order: 6 = 2-3 = 
3-2. 

(b) we will see that —2 and —3 also count as "primes", and 6 = 2-3 = (—2) • 
(-3). 

By convention, 1 is not a prime, since it divides everything. The same holds for 
— 1 (and only these two integers, since they are the only units in Z.) Accordingly, 
we will specify that irreducible elements (the analogue of primes in a general 
domain) should not be zero or units, and that we only try to factorise elements 
which are not zero or a unit. 

So we make the following definitions. Let R be an integral domain. 

(a) An element p e R is irreducible if p is not zero or a unit, but whenever 
p = ab, then one of a and b is a unit (and the other therefore an associate of 

P). 

(b) R is a unique factorisation domain if it has the following properties: 

• every element a E R which is not zero or a unit can be written as a 
product of irreducibles; 

• if p m7 gi , . . . , g„ are irreducibles and 

piP2---Pm = qiq2---q n , 

then m = n and, after possibly permuting the factors in one product, pi 
and qi are associates for i = 1 , . . . , m. 

Note that, if an element p is irreducible, then so is every associate of p. If 
the second condition in the definition of a unique factorisation holds, we say that 
"factorisation is unique up to order and associates". As we saw, this is the best we 
can expect in terms of unique factorisation! 

The ring Z is a unique factorisation domain; so is the ring F[x] of polynomials 
over any field F. We will prove these things later on; we will see that it is the Eu- 
clidean algorithm which is crucial to the proof, and the integers and polynomials 
over a field both have a Euclidean algorithm. 

Note that, to decide whether a ring is a unique factorisation domain, we have 
first to check that it really is an integral domain, and second to find all the units 
(so that we know when two elements are associates). 
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Example Here is an example of a ring which is an integral domain but not a 
unique factorisation domain. Let 

R = {a + bV^5:a,beZ}. 

We show first that R is a subring of C. Take two elements of R, say r = a + by^5 
and s = c + Jv / — 5, with a, b,c,d G Z. Then 

r-s = (a — c) + (b — d) \/—5 G R, 
rs = (ac — 5bd) + (ad + be) V—5 G R 7 

since a — c,b — d,ac — 5bc, ad + bc G Z. So the Subring Test applies. 

R is clearly an integral domain: there do not exist two nonzero complex num- 
bers whose product is zero. 

What are the units of R1 To answer this, we use the fact that \a + b\f^ 5| 2 = 
a 2 + 5b 2 . Now suppose that a + by^5 is a unit, say 

(a + by/^5)(c + d\/^5) = 1. 

Taking the modulus and squaring gives 

(a 2 + 5b 2 )(c 2 + 5d 2 ) = 1. 

So a 2 + 5b 2 = 1 (it can't be — 1 since it is positive). The only solution is a = ±1, 
b = 0. So the only units are ±1, and so r is associate only to r and — r. 
Now we show that 2 is irreducible. Suppose that 

2 = (a + bV^5)(c + dV^5). 

Taking the modulus squared again gives 

4 = (a 2 + 5b 2 )(c 2 + 5d 2 ). 

So a 2 + 5b 2 = 1, 2 or 4. But the equation a 2 + 5b 2 = 2 has no solution, while 
a 2 + 5b 2 = 1 implies a = ±1, b = 0, and a 2 + 5b 2 = 4 implies c 2 + 5d 2 = 1, so 
that c = ±1, d = 0. So the only factorisations are 

2 = 2-1 = 1-2 = (-2)-(-l) = (-2)-(-l) : 

in each case, one factor is a unit and the other is an associate of 2. 

In a similar way we can show that 3,1 + v^— 5 and 1 — \/— 5 are irreducible. 
Now consider the factorisations 

6 = 2-3 = (l + v /= 5)(l-v /= 5). 

These are two factorisations into irreducibles, which are not equivalent up to order 
and associates. So R is not a unique factorisation domain! 
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2.4.3 Principal ideal domains 

Let R be a commutative ring with identity. We denote by aR, or by (a), the set 
{ar : r e R} of all elements divisible by a. 

Lemma 2.23 (a) is an ideal ofR containing a, and if I is any ideal ofR containing 
a then (a) C /. 

Proof We apply the Ideal Test. If ar\, ar2 G (a), then 

ar\ —ar2 = a{r\ — r^) G (a). 
Also, if ar G (a) and x G R, then 

(ar)x = a(rx) G (a). 

So (I) is an ideal. 

Since R has an identity element 1, we have a = a\ G (a). 

Finally, if / is any ideal containing a, then (by definition of an ideal) we have 
ar E I for any r G R; that is, (a) C /. 

Lemma 2.24 Let R be an integral domain. Then (a) = (b) if and only if a and b 
are associates. 

Proof (a) = (b) means, by definition, that each of a and b is a multiple of the 
other, that is, they are associates. 

We call (a) the ideal generated by a and say that it is a principal ideal. 
More generally, if a\ , . . . , a n G R (where R is a commutative ring with identity, 
then we let 

(ai,...,a n ) = {naiH Yr n a n :n,...,r n eR}. 

Then it can be shown, just as above, that {a\,...,a n ) is an ideal of R contain- 
ing ai,...,a„, and that any ideal which contains these elements must contain 
(a i , . . . , a n ) . We call this the ideal generated by a\, . . . ,a n . 

A ring R is a principal ideal domain if every ideal is principal. We will see 
later that Z is a principal ideal domain. 

Proposition 2.25 Let R be a principal ideal domain. Then any two elements ofR 
have a greatest common divisor; in fact, d = gcd(a, b) if and only if (a,b) = (d). 

Proof Suppose that R is a principal ideal domain. Then (a, b) , the ideal generated 
by a and b, is a principal ideal, so it is equal to (d) , for some d G R. Now we claim 
that d = gcd(a,b). 
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• a E (d), so d \ a. Similarly d \ b. 

• Also, d E (a,b), so d = ua + vb for some w,v G R. Now suppose that e | a 
and e | I?, say a = ep and & = eg. Then d = ua + vb = e(up + vq), so that 
e | J. 

The claim is proved. 

Since any two gcds of a and b are associates, and any two generators of (a, b) 
are associates, the result is proved. 

Example The ring Z is a principal ideal domain. That means that the only ideals 
in Z are the sets (n) = nZ, for n E Z. We will deduce this from a more general 
result in the next section. 

Now it is the case that any principal ideal domain is a unique factorisation 
domain. We will not prove all of this. The complete proof involves showing two 
things: any element which is not zero or a unit can be factorised into irreducibles; 
and any two factorisations of the same element differ only by order and associates. 
We will prove the second of these two assertions. See the appendix to this chapter 
for comments on the first. 

Lemma 2.26 Let R be a principal ideal domain; let p be irreducible in R, and 
a,b e R. If p | ab, then p \ a or p \ b. 

Proof Suppose that p | ab but that p does not divide a. Then we have gcd(a, p) = 
1, and so there exist u,v <ER with 1 = ua + vp. So b = uab + vpb. But p | uab by 
assumption, and obviously p | vpb; so p | b, as required. 

This lemma clearly extends. If p is irreducible and divides a product a\at ■■■a n , 
then p must divide one of the factors. For either p \ a\ or p \ a2 ■ ■ ■ a n ; in the latter 
case, proceed by induction. 

Theorem 2.27 Let R be a principal ideal domain, and suppose that 

a = piP2---Pm = qiq2---q n , 

where p\ , . . . , p m , q\ , . . . , q n are irreducible. Then m = n and, after possibly per- 
muting the factors, pi and qi are associates for i = 1 , . . . , m. 

Proof Obviously p\ divides q\ ■ - -qn, so p\ must divide one of the factors, say 
p\ | Since p\ and qi are irreducible, they must be associates. By permuting the 
order of the qs and adjusting them by unit factors, we can assume that p\ = q\. 
Then p2- ■ • p m = <?2 • • -qn, and we proceed by induction. 
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Example Here is an example of an integral domain which is not a principal ideal 
domain. Consider the ring R = Z[x] of polynomials over the integers. Let / be the 
set of all such polynomials whose constant term is even. Then I is an ideal in R: if 
/ and g are polynomials with even constant term, then so is / — g, and so is fh for 
any polynomial h. But / is not a principal ideal. For / contains both the constant 
polynomial 2 and the polynomial x of degree 1. If / = (a), then a must divide both 
2 and*, so a = ±1. But ±1 /. 

The polynomials 2 and x are both irreducible in R, and so their gcd is 1 . But 1 
cannot be written in the form 2u + xv for any polynomials u and v. 

The ring Z[x] is a unique factorisation domain (see the Appendix to this chap- 
ter). 

2.4.4 Euclidean domains 

Any two integers have a greatest common divisor, and we can use the Euclidean 
algorithm to find it. You may also have seen that the Euclidean algorithm works 
for polynomials. We now give the algorithm in a very general form. 

Let R be an integral domain. A Euclidean function on R is a function d from 
the set R \ {0} (the set of non-zero elements of R) to the set N of non-negative 
integers satisfying the two conditions 

(a) for any a,b ER with a,b^0, we have d{ab) > d(a) ; 

(b) for any a,b ER with b ^ 0, there exist q,r ER such that 

• a = bq + r; 

• either r = 0, or d(r) < d(b). 

We say that an integral domain is a Euclidean domain if it has a Euclidean func- 
tion. 

Example Let R = Z, and let d(a) = \a\ for any integer a. 

Example Let R = F[x], the ring of polynomials over F, where F is a field. For 
any non-zero polynomial f(x), let d(f(x)) be the degree of the polynomial f(x) 
(the index of the largest non-zero coefficient). 

Both of these examples are Euclidean functions. 

(a) In the integers, we have d(ab) = \ab\ = \a\ ■ \b\ > \a\ = d(a), since b ^ 0. 
In the polynomial ring F[x], we have 

d{ab) = deg(ab) = deg(a) +deg(£) > deg(a), 
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since if the leading terms of a and b are a n x n and b m x m respectively then the 
leading term of ab is a n b m x n+m . 

(b) In each case this is the "division algorithm": we can divide a by b to obtain a 
quotient q and remainder r, where r is smaller than the divisor b as measured 
by the appropriate function d. 

You will have seen how to use the Euclidean algorithm to find the greatest 
common divisor of two integers or two polynomials. The same method works in 
any Euclidean domain. It goes like this. Suppose that R is a Euclidean domain, 
with Euclidean function d. Let a and b be any two elements of R. If b = 0, then 
gcd(a,&) = a. Otherwise, proceed as follows. Put a = ao and b = a\. If a,[-\ and 
at have been constructed, then 

• if cii = then gcd(a,&) = a;_i; 

• otherwise, write = aiq + r, with r = or d{r) < d(ai), and set a,-+i = r; 
repeat the procedure for a,- and a i+ \. 

The algorithm terminates because, as long as ai ^ 0, we have 



Since the values of d are non-negative integers, this chain must stop after a finite 
number of steps. 

To see that the result is correct, note that, if a = bq + r, then 



(as an easy calculation shows: the common divisors of a and b are the same as the 
common divisors of b and r. So we have gcd(a;_i,a;) = gcd(a,b) as long as a,- is 
defined. At the last step, at = and so gcd(a,&) = gcd(a,-_i,0) = a,-_i. 

The algorithm can also be used to express gcd(a,&) in the form ua + vb for 
some u,v E R. For a and b themselves are both expressible in this form; and, if 
= Ui-ia + Vj-ib and a,- = uia + Vib, then with a^-i = ^a,- + a,-+i, we have 



< rf(a ( _i) < ■ • • < rf(ai). 



gcd(a,Z>) =gcd(&,r) 



flf+l = flf-l - tffl/ = ("i-l - + (v;-i - #v,-)fc. 



Example Find gcd(204, 135). We have 



204 



135 



66 



69 



135-1 + 69, 
69-1 + 66, 
66-1 + 3, 
3-22, 



48 



CHAPTER 2. RINGS 



so gcd(204, 135) = 3. To express 3 = 204m + 135v, we have 

69 = 204- 1 -135-1, 
66 = 135-69 = 135-2-204-1, 
3 = 69-66 = 204-2-135-3. 

We will show that a Euclidean domain is a unique factorisation domain. First 
we need one lemma. Note that, if a and b are associates, then b = au, so d(b) > 
d(a), and also a = bu~ l , so d(a) > d(b); so we have d(a) = d(b). 

Lemma 2.28 Let R be a Euclidean domain. Suppose that a and b are non-zero 
elements ofR such that a \ b and d(a) = d(b). Then a and b are associates. 

Proof Let a = bq + r for some q, r, as in the second part of the definition. Sup- 
pose that r^O. Now b = ac for some element c; so a = acq + r. Thus, r = 
a(l — cq), and since r^Owe have d(r) > d(a), contrary to assumption. So r = 0. 
Then b \ a; since we are given that a \ b, it follows that a and b are associates. 

Theorem 2.29 (a) A Euclidean domain is a principal ideal domain, 
(b) A Euclidean domain is a unique factorisation domain. 

Proof (a) Let R be a Euclidean domain, and let / be an ideal in R. If I = {0}, 
then certainly / = (0) and / is principal. So suppose that / is not {0}. Since the 
values of d(x) for x El are non-negative integers, there must be a smallest value, 
say d(a). We will claim that / = (a). 

First, take be (a), say b = ax. Then b E /, by definition of an ideal. 

Next, take b El. Use the second part of the definition of a Euclidean function 
to find elements q and r such that b = aq + r, with either r = or d(r) < d(a). 
Suppose that r ^ 0. Then b El and aq E 1 ', so r = b = aq E T, but d(r) < d{a) 
contradicts the fact that d(a) was the smallest value of the function d on the non- 
zero elements of /. So the supposition is impossible; that is, r = 0, and b = aq E 
(a). 

So I = (a) is a principal ideal. 

(b) Again let R be a Euclidean domain. We show that any nonzero non-unit 
of R can be factorised into irreducibles. We showed in the last section that the 
factorisation is unique (because R is a principal ideal domain) 

Choose any element a E R such that a ^ and a is not a unit. We have to 
show that a can be factorised into irreducibles. The proof is by induction on d(a); 
so we can assume that any element b with d(b) < d(a) has a factorisation into 
irreducibles. 
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If a is irreducible, then we have the required factorisation with just one term. 
So suppose that a = bc where b and c are not units. If d{b) < d(a) and d{c) < d(a) 
then, by induction, each of b and c has a factorisation into irreducibles; putting 
these together we get a factorisation of a. So suppose that d(a) > d(b). We also 
have d(b) > d(a), by the first property of a Euclidean function; so d(a) = d(b). 
We also have b | a; by the Lemma before the Theorem, we conclude that a and b 
are associates, so that c is a unit, contrary to assumption. 

Corollary 2.30 (a) Z is a principal ideal domain and a unique factorisation 
domain. 

(b) For any field F, the ring F[x] of polynomials over F is a principal ideal 
domain and a unique factorisation domain. 

Proof This follows from the theorem since we have seen that these rings are 
integral domains and have Euclidean functions, and so are Euclidean domains. 

2.4.5 Appendix 

More is true than we have proved above. You will meet these theorems in the 
Algebraic Structures II course next term. 

The connection between the three types of domain is: 

Theorem 2.31 

Euclidean domain =>- principal ideal domain =>- unique factorisation domain. 

We proved most of this: we showed that a Euclidean domain is a principal 
ideal domain, and that in a principal ideal domain factorisations are unique if they 
exist. The proof that factorisations into irreducibles always exist in a principal 
ideal domain is a little harder. 

Neither implication reverses. We saw that Z[x] is not a principal ideal domain, 
though it is a unique factorisation domain (see below). It is harder to construct a 
ring which is a principal ideal domain but not a Euclidean domain, though such 
rings do exist. 

Another way to see the increasing strength of the conditions from right to left 
is to look at greatest common divisors. 

• In a unique factorisation domain, any two elements a and b have a greatest 
common divisor d (which is unique up to associates). 

• In a principal ideal domain, any two elements a and b have a greatest com- 
mon divisor d (which is unique up to associates), and d can be written in 
the form d = xa + yb. 
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• In a Euclidean domain, any two elements a and b have a greatest common 
divisor d (which is unique up to associates), and d can be written in the form 
d = xa+yb; moreover, the gcd, and the elements x and y, can be found by 
the Euclidean algorithm. 

You will also meet the theorem known as Gauss's Lemma: 

Theorem 2.32 IfR is a unique factorisation domain, then so is R[x\. 

This result shows that Z[x] is a unique factorisation domain, as we claimed 
above. 

2.5 Fields 

As you know from linear algebra, fields form a particularly important class of 
rings, since in linear algebra the scalars are always taken to form a field. 

Although the ring with a single element would technically qualify as a field 
according to our definition, we always rule out this case. Thus, 

Afield must have more than one element. 

Another way of saying the same thing is that, in a field, we must have 1^0. (If 
there is any element x ^ in a ring with identity, then l-x = x^0 = 0-x, and so 
1^0.) 

The "standard" examples of fields are the rational, real and complex numbers, 
and the integers mod p for a prime number p. 

In this chapter, we will see how new fields can be constructed. The most im- 
portant method of construction is adjoining a root of a polynomial. The standard 
example of this is the construction of C by adjoining the square root of —1 (a 
root of the polynomial x 2 + 1 = 0) to R. We will also see that finite fields can be 
constructed in this way. 

Also we can build fields as fields of fractions; the standard example is the 
construction of the rationals from the integers. 

2.5.1 Maximal ideals 

In this chapter, R always denotes a commutative ring with identity. As above, we 
assume that the identity element 1 is different from the zero element 0: that is, 
0^1. 

An ideal / of R is said to be proper if / ^ R. An ideal / is maximal ifl^R and 
there does not exist an ideal J with / C J C R; that is, any ideal J with / C7Ci? 
must satisfy J = I or J = R. 
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Lemma 2.33 Let R be a commutative ring with identity. Then R is afield if and 
only if it has no ideals except {0} andR. 

Proof If u E R is a unit, then the only ideal containing u is the whole ring R. (For, 
given any ideal / with w6/, and any r E R, we have r = u(u~ l r) E I, so I = R.) If 
R is a field, then every non-zero element is a unit, and so any ideal other than {0} 
is R. 

Conversely, suppose that the only ideals are and R. We have to prove that 
multiplicative inverses exist (axiom (M3)). Take any element a E R with a^O. 
Then (a) = R, so 1 G (a). This means that there exists b G R with ab = 1, so 
b = aT x as required. 

Proposition 2.34 Let F be a commutative ring with identity, and I a proper ideal 
ofR. Then R/I is afield if and only if I is a maximal ideal. 

Proof By the Second Isomorphism Theorem, ideals of R/I correspond to ideals 
of R containing /. Thus, / is a maximal ideal if and only if the only ideals of R/I 
are zero and the whole ring, that is, R/I is a field (by the Lemma). 

Proposition 2.35 Let R be a principal ideal domain, and I = (a) an ideal of R. 
Then 

(a) I = R if and only if a is a unit; 

(b) I is a maximal ideal if and only if a is irreducible. 



Proof (a) If a is a unit, then for any r G R we have r = a(a~ l r) G (a), so (a) = R. 
Conversely, if (a) = R, then 1 = ab for some b G R, and a is a unit. 

(b) Since R is a PID, any ideal containing (a) has the form (b) for some b ER. 
Moreover, (a) C (b) if and only if b \ a. So (a) is maximal if and only if, whenever 
b | a, we have either b is a unit (so (b) =R) orb is an associate of a (so (b) = (a). 

Corollary 2.36 Z/nZ is afield if and only ifn is prime. 

Proof Z is a principal ideal domain, and irreducibles are just the prime integers. 
The field Z/ pZ, for a prime number p, is often denoted by ¥ p . 
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2.5.2 Adding the root of a polynomial 

The other important class of principal ideal domains consists of the polynomial 
rings over fields. For these, Propositions 2.34 and 2.35 give the first part of the 
following result. 

Proposition 2.37 Let F be a field and f(x) an irreducible polynomial over F. 
Then K = F[x]/(f(x)) is afield. Moreover, there is an isomorphism from F to a 
subfield of K; and, if a denotes the coset (f(x)) +x, then we have the following, 
where n is the degree of f(x), and we identify an element ofF with its image under 
the isomorphism: 

(a) every element ofk can be uniquely written in the form 

c + cia + c 2 a 2 H hc„_ia" _1 ; 

(b) f(a) = 0. 

Before proving this, we notice that this gives us a construction of the complex 
numbers; Let F = R, and let f{x) = x 2 + 1 (this polynomial is irreducible over W). 
Use the notation i instead of a for the coset (/(*)) +x. Then we have n = 2, and 
the two parts of the proposition tell us that 

(a) every element of K can be written uniquely as a + bi, where a, b E K; 

(b) i 2 = -l. 

Thus, K = M.[x]/ (x 2 + 1) is the field C. The general theory tells us that this con- 
struction of C does produce a field; it is not necessary to check all the axioms. 

Proof (a) Let / denote the ideal (f(x)}. Remember that the elements of the 
quotient ring F[x]/I are the cosets of / in F[x\. The isomorphism from F to 
K = F [x] 1 1 is given by 

a6 = I + a for a E F. 

Clearly 6 is one-to-one; for if a6 = b6, then b — a El, but / consists of all mul- 
tiples of the irreducible polynomial f(x), and cannot contain any constant poly- 
nomial except 0, so a = b. It is routine to check that preserves addition and 
multiplication. From now on, we identify a with the coset I + a, and regard F as 
a subfield of F[x]/I. 

Let g(x) E F[x\. Then by the Euclidean algorithm we can write 



g (x) =f(x)q(x) + r(x), 
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where r(x) = or r{x) has degree less than n. Also, since g(x) — r(x) is a multiple 
of f(x), it belongs to I, and so the cosets I + g(x) and I + r{x) are equal. In other 
words, every coset of I in F[x] has a coset representative with degree less than n 
(possibly zero). This coset representative is unique, since the difference between 
any two coset representatives is a multiple of f(x) . 

Now let r{x) = cq + c\x + C2X 2 H h c n -\x n ~ l . We have 

I + r(x) = 1+ (c + cix + c 2 x 2 -\ hc„_ix" _1 ) 

= (7 + c ) + (/ + d)(/ + x) + (I + c 2 )(I + x) 2 + ■ ■ ■ + (I + c n ^)(I + x)"- 1 ) 
= c + cia + c 2 a 2 -\ hc„_ia' I_1 . 

Here, in the second line, we use the definition of addition and multiplication of 
cosets, and in the third line we put I+x = a and use our identification of /+ c = cO 
with c for c G F. 

So we have the required representation. Clearly it is unique. 

(b) As before, if f(x) = ciq + a\x-\ \-a n x n , we have I + f(x) = I (since 

f(x) G /), and so 

= 7 + 

= I+(ao + a[X-\ \-a n x") 

= (I + a ) + (I + a 1 )(I+x) + - •• + (/ + a n )(I+x) n 

= ao + ai(X-\ Ya n a n 

= /(«)• 

2.5.3 Finite fields 

Suppose that f(x) is an irreducible polynomial of degree n over the field ¥ p of 
integers mod p. Then K = ¥ p [x]/ (/(*)) is a field, by Proposition 2.37. According 
to that proposition, its elements can be written uniquely in the form 

co + ciCt-\ hc„_ia" _1 

for co, ... , c n - 1 G Fp. There are p choices for each of the n coefficients cq , c\ , . . . , c M _ i , 
giving a total of p n elements altogether. Thus: 

Proposition 2.38 Let f(x) be an irreducible polynomial of degree n over ¥ p . 
Then K = ¥ p [x]/{f(x)) is afield containing p n elements. 
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Example Let p = 2 and n = 2. The coefficients of a polynomial over F2 must 
be or 1, and so there are just four polynomials of degree 2, namely x 2 , x 2 + 1, 
x 2 + x and x 2 + x + 1 . We have 

x 2 =x-x, x 2 + x = x- (x+ 1), x 2 + 1 = (x+ 1) ■ (x+ 1) 

(remember that 1 + 1 = in F2!), and so the only irreducible polynomial is x 2 + 
x+l. Thus, there is a field consisting of the four elements 0, 1, Of, 1 + a, in which 
a 2 + a + \ = 0, that is, a 2 = 1 + a (since -1 = +1 in F 2 !) The addition and 
multiplication tables are easily found (with /3 = 1 + a) to be 



+ 
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a 


J8 
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a 


J8 
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a 


J8 
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1 
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a 


1 
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a 


J8 


a 
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J8 
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a 
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J8 


1 


P 


P 


a 


1 





J8 





P 


1 


a 



We have, for example, 

a + /3 = a+l + a = l, 
af3 = a(l + a) = a + a 2 = 1, 
/3 2 = (l + a) 2 = l + a = j8. 

The basic facts about finite fields were one 
of the discoveries of Evariste Galois, the 
French mathematician who was killed in a 
duel in 1832 at the age of 19. Most of his 
mathematical work, which is fundamental 
for modern algebra, was not published until 
fifteen years after his death, but the result on 
finite fields was one of the few papers 
published during his lifetime. 



Galois proved the following theorem: 

Theorem 2.39 The number of elements in a finite field is a power of a prime. For 
any prime power p n , there is a field with p" elements, and any two finite fields 
with the same number of elements are isomorphic. 

We commemorate Galois by using the term Galois field for finite field. If 
q = p n , then we often denote the field with q elements by GF(q). Thus the field 
on the preceding page is GF(4). (Note that GF(4) is not the same as Z/4Z, the 
integers mod 4, which is not a field!) 




2.5. FIELDS 



55 



2.5.4 Field of fractions 

In this section we generalise the construction of the rational numbers from the 
integers. [This section and the two following were not covered in the lectures, but 
you are encouraged to read them for interest.] 

Theorem 2.40 Let R be an integral domain. Then there is afield F such that 

(a) R is a subring ofF; 

(b) every element ofF has the form ab~ , for a,b E R and b ^ 0. 

The field F is called the field of fractions of R, since every element of F can 
be expressed as a fraction a/b. 

We will build F as the set of all fractions of this form. But we have to answer 
two questions? 

• When are two fractions equal? 

• How do we add and multiply fractions? 

Thus, we start with the set X consisting of all ordered pairs (a, b), with a,beR 
and b ^ 0. (That is, X = R x (R \ {0}).) The ordered pair (a,b) will "represent" 
the fraction a/b. So at this point we have to answer the first question above: 
when does a/b = c/dl Multiplying up by bd, we see that this holds if and only if 
ad = be. Thus, we define a relation ~ on X by the rule 

(a,b) ~ (c,d) if and only if ad = be. 

We have to show that this is an equivalence relation. 

reflexive: ab = ba, so (a,b) ~ (a,b). 

symmetric: If (a,b) ~ (c, d), then ad = be, so cb = da, whence (c, d) ~ (a,b). 

transitive: Suppose that (a,b) ~ (c,d) and (c,d) ~ (<?,/). Then ad = be and 
cf = de. So adf = bef = bde. This means that d(af — be) = 0. But 
d 7^ and R is an integral domain, so we conclude that af = be, so that 

(a,b) ~ (<?,/). 

Now we let F be the set of equivalence classes of the relation ~. We write the 
equivalence class containing (a, b) as a/b. Thus we do indeed have that a/b = c/d 
if and only if ad = bc. 

Now we define addition and multiplication by the "usual rules": 
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• (a/b) + (c/d) = (ad + bc)/(bd); 

• (a/b)(c/d) = (ac)/(bd). 

(To see where these rules come from, just calculate these fractions in the usual 
way!) Again, since b ^ and d ^ 0, we have bd ^ 0, so these operations make 
sense. We still have to show that they are well-defined, that is, a different choice 
of representatives would give the same result. For addition, this means that, if 
(a,b) ~ (a',b f ) and (c,d) ~ (c',d f ), then (ad + bc,bd) ~ (a'd' + b'c' \b'd'). Trans- 
lating, we have to show that 

if ab' = bd and cd' = dc', then (ad + bc)b'd' = bd(a'd' + b'c') , 

a simple exercise. The proof for multiplication is similar. 

Now we have some further work to do. We have to show that 

• F, with addition and multiplication defined as above, is a field; 

• the map 9 defined by a0 = a/ 1 is a homomorphism from RtoF, with kernel 
{0} (so that R is isomorphic to the subring {a/\ : a G R} of F). 

These are fairly straightforward to prove, and their proof finishes the theorem. 

2.5.5 Appendix: Simple rings 

We saw at the start of this chapter (Lemma 2.33) that, if R is a commutative 
ring with identity having no ideals except the trivial ones, then R is a field. You 
might think that, if we simply leave out the word "commutative", then we obtain a 
characterisation of division rings. Unfortunately this is not so. The material here 
is not part of the course; you can find a proof in the course textbook if you are 
interested. Let R be a ring with identity. We say that R is a simple ring if the only 
ideals in R axe {0} and R. Then every division ring (and in particular every field) 
is a simple ring, and our earlier argument shows that a commutative simple ring 
is a field. But we have the following fact: 

Theorem 2.41 Let R be a simple ring (with identity). Then the ring M n (R) of 
nx n matrices over R is a simple ring. 

In particular, the ring ofnxn matrices over a field F is a simple ring, although 
it is not commutative and is not a division ring for n > 1. 
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2.5.6 Appendix: The number systems 

This section is not part of the course and is just for general interest. How do we 
build the number systems Z, Q, R and C? 

I'll leave out the construction of Z. 
Kronecker said, "God made the integers; the 
rest is the work of man", and though we do 
now know how to construct the integers 
(starting with nothing but the empty set), it is 
not straightforward. 



We construct Q as the field of fractions of Z. 

To construct R from Q, we borrow an idea from analysis, the definition of 
a Cauchy sequence: the sequence (ao,«i,«2, • • •) is a Cauchy sequence if, given 
any e > 0, there exists a positive integer N such that, for all m,n> N, we have 

\cifn (Xn | <C £. 

We let R be the set of all Cauchy sequences of rational numbers. We make 
R into a ring by adding and multiplying sequences term by term. Then let / be 
the set of all null sequences of rational numbers (sequences which converge to 0.) 
Then it can be shown that R is a commutative ring with identity, and / a maximal 
ideal; so R/I is a field. This is the field R of real numbers. 

We saw that C is constructed from R by adding a root of the irreducible poly- 
nomial x 2 + 1: that is, C = R[x]/(x 2 + 1). 
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Chapter 3 
Groups 



In the remainder of the notes we will be talking about groups. A group is a struc- 
ture with just one binary operation, satisfying four axioms. So groups are only 
half as complicated as rings! As well as being new material, this part will help 
you revise the first part of the course, since a lot of things (subgroups, homo- 
morphisms, Isomorphism Theorems) work in almost exactly the same way as for 
rings. 

3.1 Introduction 

3.1.1 Definition of a group 

A group is a set G with one binary operation (which we write for now as o in infix 
notation 1 ) satisfying the following four axioms (G0)-(G3): 

(GO) (Closure law) For any g,h EG, we have goh EG. 

(Gl) (Associative law) For any g,h,k E G, we have {goh) ok = go (hok). 

(G2) (Identity law) There is an element e EG with the property that goe = eog = 
g for all g EG. (The element e is called the identity element of G.) 

(G3) (Inverse law) For any element g E G, there is an element h EG satisfying 
goh = ho g = e. (We denote this element h by and call it the inverse 
ofg.) 

If a group G also satisfies the condition 

'Remember that this means that the result of applying the operation to a and b is written as 
aob. 
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(G4) (Commutative law) For any g,h e G, we have goh = ho g, 

then G is called a commutative group or (more often) an Abelian group. 

3.1.2 Examples of groups 

Axioms (G0)-(G4) for a group are just axioms (A0)-(A4) for a ring but using 
slightly different notation (the set is G instead of R, the operation is o instead of 
+, and so on). So we get our first class of examples: 

Proposition 3.1 Let R be a ring. Then R with the operation of addition is an 
Abelian group: the identity element is 0, and the inverse of a is —a. 

This group is called the additive group or R. 
This is not the only way to get groups from rings. 

Proposition 3.2 Let R be a ring with identity, and U (R) the set of units ofR. Then 
U (R), with the operation of multiplication, is a group. IfR is a commutative ring, 
then U (R) is an Abelian group. 

This group is called the group of units of R. 

Proof Look back to Lemma 2.20. Let U (R) be the set of units of R. 

(GO) Lemma 5.1(c) shows that, if u and v are units, then so is uv. So U(R) is 
closed for multiplication. 

(Gl) The associative law for multiplication holds for all elements of R (by Ax- 
iom (Ml) for rings), and so in particular for units. 

(G2) Lemma 5.1(a) shows that 1 is a unit. It clearly plays the role of the identity 
element. 

(G3) Lemma 5.1(b) shows that, if u is a unit, then so is u~ l . 

(G4) For the last part of the Proposition, if R is a commutative ring, then (M4) 
holds, so that uv = vu for all w, v G R; in particular, this holds when u and v 
are units. 

Example If F is a field, then every non-zero element of F is a unit. So the set of 
non-zero elements forms an Abelian group with the operation of multiplication. 
This is called the multiplicative group of the field. 
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Example A matrix A is a unit in M n (F), where F is a field, if and only if 
det(A) 7^ 0. So the set of matrices with non-zero determinant is a group. This 
group is called the general linear group, and written GL(n,F). If n > 1, this 
group is not Abelian. 

Direct product This construction corresponds to the direct sum for rings. 
Let Gi and G2 be groups. The direct product G\ x G2 is defined as follows: 

• the set of elements is the Cartesian product (which is also denoted by G\ x 
G2), the set of all ordered pairs {g\,gi) with gi e G\ and g2 G G2; 

• the group operation is "componentwise", that is, 

(ghg2)°(hhh 2 ) = (gioh 1 ,g 2 oh 2 ). 

It is an exercise to prove that it is a group. 

Cayley tables As with any binary operation, the group operation can be repre- 
sented by giving an operation table. In the case of a group, the operation table is 
usually called the Cayley table of the group. In principle, given the Cayley table 
of a finite group, we could check that all the group axioms are satisfied. 

Here, for example, are the Cayley tables of two groups each with four ele- 
ments. 






e 


X 


y 


z 





e 


a 


b 


c 


e 


e 


X 


y 


z 


e 


e 


a 


b 


c 


X 


X 


y 


z 


e 


a 


a 


e 


c 


b 


y 


y 


z 


e 


X 


b 


b 


c 


e 


a 


z 


z 


e 


X 


y 


c 


c 


b 


a 


e 



Each group is Abelian, as we can see because the tables are symmetric. These 
two groups are obviously different: in the second group, each element is equal to 
its inverse, whereas this is not true in the first group. (When we come to define 
isomorphism, we will say that the two groups are not isomorphic.) 

In fact, these groups are the additive groups of the rings Z/4Z and GF(4) 
respectively. 

3.1.3 Properties of groups 

Some of these properties will look very familiar, since they are similar to what we 
saw for rings. 
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Uniqueness of identity element 

The identity element of a group is unique. For suppose that there are two 
identity elements, say e\ and (This means that goe\ = e\ og = g for all g, and 
also g o e 2 = e 2 ° g = g for all g.) Then 

<?i = e\ oe 2 = <? 2 . 



Uniqueness of inverse 

The inverse of a group element g is unique. For suppose that h and k are both 
additive inverses of g. (This means that goh = ho g = e and gofc = fcog = e-we 
know now that there is a unique identity element e). Then 

h = hoe = ho(gok) = (hog)ok = eok = k, 

where we use the associative law in the third step. 
We denote the inverse of g by g~ l . 

Composing more than two elements 

We showed in Proposition 2.6 that, as long as the associative law holds, the 
result of composing any number of elements is independent of the way that the 
product is bracketed: for example, a o ((b o c) o d) = (a o b) o (c o d). Since the 
associative law holds in a group, we have: 

Proposition 3.3 Let gi,. . .,g n be elements of a group G. Then the composition 

81 °g2°---°gn 
is well-defined, independent of the way it is bracketed. 

Cancellation laws 

Proposition 3.4 In a group G, if a og = bog, then a = b. Similarly, ifg o a = g o b, 
then a = b. 

Proof Suppose that aog = bog, and let h = g~ 1 . Then 

a = aoe = ao( K goh) = (aog)oh = (bog)oh = bo( K goh)=boe = b. 
The other law is proved similarly. 
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These facts are the cancellation laws. 

Proposition 3.5 The inverse of g oh is h^ 1 o g~ l . 
Proof 

(goh)o(h~ l og- 1 ) = go(hoh~ l )og~ 1 =goeog- 1 =gog~ l = e , 
using the associative law; and similarly (h^ 1 og~ l )o(g°h) = e. 

3.1.4 Notation 

The notation g o h for the group operation is a bit cumbersome, and we now change 
things. 

If we are only interested in Abelian groups, we use + as the symbol for the 
group operation, for the group identity, and — g for the inverse of g. This agrees 
with the additive notation in a ring. Indeed, the additive group of a ring is an 
Abelian group, and every Abelian group is the additive group of a ring. [To see 
this, take the group operation as addition, and construct the zero ring: all products 
are zero.] 

For general groups which may not be Abelian, we use juxtaposition for the 
group operation, 1 for the identity, and g~ l for the inverse of g. (This is like 
multiplicative notation in a ring, but it is not true that every group is the group of 
units in some ring!!) 

This table gives the correspondences. 



Type of group 


Operation 


Notation 


Identity 


Inverse 


General 


o 


goh 


e 


s- 1 


Abelian 


+ 


g + h 





-g 


General 


Juxtaposition 


gh 


1 


8- 1 



For the rest of this course, our notation for the group operation will be juxta- 
position. 

3.1.5 Order 

The term order has two quite different meanings in group theory: be careful not 
to confuse them. In the next chapter we will see that there is a close relationship 
between the two meanings. 

The order of a group is the number of elements of the group. It may be finite 
(in which case it is a positive integer), or infinite. 
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To define the second kind of order, we introduce the notation g n . This means 
the result of composing n factors g together: 

g n = gg---g(n factors). 

More formally, g° = 1, and for any positive integer n, g n = g- g n ~ l . 
The order of an element g in a group is defined as follows: 

• If g n = 1 for some positive integer n, then the smallest such n is called the 
order of g. 

• If no such n exists, we say that g has infinite order. 

Thus, the identity element always has order 1 . If an element g has order 2, then 
it is equal to its inverse (for g 2 = 1 = gg^ 1 implies g = g~ l by the Cancellation 
Law.) 

Consider the additive group of the ring Z. (Recall that the operation is + and 
the zero element is 0; so, instead of g n we write n ■ g, and the order is the smallest 
positive n such that n ■ g = 0, or is infinite if no such n exists.) The element 1 has 
infinite order, since there is no positive integer n such that n ■ 1 = 0. 

In the first group in our two examples above of Cay ley tables, the elements x 
and z have order 4 (we have x 2 = y, x 3 = z, x 4 = e which is the identity element), 
while y has order 2. In the second group, all of a, b, c have order 2. 

3.1.6 Symmetric groups 

We end this chapter by defining an important class of groups. 

Let X be any set. A permutation of X is a function g : X — > X which is one-to- 
one and onto, that is, a bijection from X to X. Recall the discussion of permuta- 
tions in Chapter 1 . 

Let S n be the set of all permutations of the set {1, ... ,n}. We have 

\S n \ = n\ = n(n — l)(n — 2) • • • 1. 

For consider the two-line representation. The top row is (12 ... n). The bottom 
row consists of the same numbers in any order. Thus there are n possibilities for 
the first entry in the bottom row; n—l possibilities for the second (anything except 
the first), n — 2 possibilities for the third; and so on. 

Now we define an operation on permutations as follows. If g is a permutation, 
denote the image of the element x E { 1 , . . . , n} by xg. (As with homomorphisms, 
we write the function on the right of its input.) Now if g and h are two permuta- 
tions, their composition gig2 is defined by 



x(gh) = (xg)h for all x £ {!,... ,n}. 
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In other words the rule is "apply g, then h". 

For example, if g is the permutation (1,3, 5) (2, 4) (6) in our above example, 
and h = (1,2,3,4,5,6), then gh = (1,4,3,6)(2,5). You are strongly urged to 
practice composing permutations given in cycle form! 

Theorem 3.6 The set S n of permutations of {I, ... ,n}, with the operation of com- 
position, is a group. 

Proof (GO) If g and h are bijections, we have to show that gh is a bijection. 

- To show that it is one-to-one, suppose that x(gh) = y{gh). By defi- 
nition this means (xg)h = (yg)h. Since h is one-to-one, this implies 
xg = yg; then, since g is one-to-one, this implies x = y. 

- To show that it is onto, choose any element z G {1, . . . ,n}. Since h is 
onto, we can find y such that yh = z. Then since g is onto, we can find 
x such that xg = y. Then x{gh) = (xg)h = yh = z. 

(Gl) Let g, h,k be three permutations. To show that g(hk) = (gh)k, we have 
to show that these two permutations have the same effect on any element 
x G { 1 , . . . , n}. Now we have 

x(g(hk)) = (xg)(hk) = ((xg)h)k) = (x(gh))k = x((gh)k), 

as required. 

(G2) The identity permutation 1 is the permutation which leaves everything as it 
was: that is, x\ = x for all x G {1, . . . ,«}. Then x(lg) = (xl)g = xg for all 
x, so that Ig = g; similarly gl = g. 

(G3) The inverse of a permutation g is simply the "inverse function" which un- 
does the effect of g: that is, xg~ l = y if yg = x. Then it is clear that 

We call this group the symmetric group on n symbols, and denote it by S n . 
Proposition 3.7 S n is not Abelian for n>3. 

Proof If g = (1,2) and h = (1,3) (all other points are fixed), then gh = (1,2,3) 
but hg= (1,3,2). 

Exercise Show that 52 is an Abelian group. 
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Exercise Verify the following Cayley table for 53 : 
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(1,2,3) 


(1,3,2) 


(1,2) 


(2,3) 


(1,3) 


1 


1 


(1,2,3) 


(1,3,2) 


(1,2) 


(2,3) 


(1,3) 


(1,2,3) 


(1,2,3) 


(1,3,2) 
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(2,3) 


(1,3) 


(1,2) 


(1,3,2) 


(1,3,2) 


1 


(1,2,3) 


(1,3) 


(1,2) 


(2,3) 


(1,2) 


(1,2) 


(1,3) 


(2,3) 


1 


(1,3,2) 


(1,2,3) 


(2,3) 


(2,3) 


(1,2) 


(1,3) 


(1,2,3) 


1 


(1,3,2) 


(1,3) 


(1,3) 


(2,3) 


(1,2) 


(1,3,2) 


(1,2,3) 


1 



We end with a result which you probably met in Discrete Maths. 



Proposition 3.8 Let g be an element of S„, written in cycle notation. Then the 
order of g is the least common multiple of its cycle lengths. 

Proof Take any cycle of g, say of length k. Then the points in this cycle return 
to their original position after g has been applied k times. So g n fixes the points of 
this cycle if and only if n is a multiple of k. 

We deduce that g n = 1 if and only if n is a multiple of every cycle length. So 
the order of g is the least common multiple of the cycle lengths. 

So, of the elements of 53, the identity has order 1, the elements (1,2), (2,3) 
and (1,3) have order 2, and (1,2,3) and (1,3,2) have order 3. (Remember that 
(1,2) is really (1,2) (3), with a cycle of length 1; but this doesn't alter the least 
common multiple of the cycle lengths.) 

3.2 Subgroups 

This section corresponds to Section 2.2 on subrings. 

3.2.1 Subgroups and subgroup tests 

A subgroup of a group G is a subset of G which is a subgroup in its own right 
(with the same group operation). 

There are two subgroup tests, resembling the two subring tests. 

Proposition 3.9 (First Subgroup Test) A non-empty subset H of a group G is a 
subgroup ofG if for any h 7 k E H, we have hk EH and hr l EH. 
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Proof We have to show that H satisfies the group axioms. The conditions of 
the test show that it is closed under composition (GO) and inverses (G3). The 
associative law (Gl) holds in H because it holds for all elements of G. We have 
only to prove (G2), the identity axiom. 

We are given that H is non-empty, so choose h G H. Then by assumption, 
h~ l eH, and then (choosing k = h~ l ) 1 = hh~ l G H. 

We can reduce the number of things to be checked from two to one: 

Proposition 3.10 (Second Subgroup Test) A non-empty subset H of a group G 
is a subgroup of G if for any h,k G //, we have hk~ l G H. 

Proof Choosing k = h, we see that 1 = hh G H. Now using 1 and h in place 
of h and k, we see that hr x = \hr x G H. Finally, given h,k G H, we know that 
k~ l G H, so hk = h(k~ l )~ l G H. So the conditions of the First Subgroup Test 
hold. 



Example Look back to the Cayley tables in the last chapter. In the first case, 
{e,y} is a subgroup. In the second case, {e,a}, {e,b} and {e,c} are all subgroups. 



3.2.2 Cyclic groups 

If g is an element of a group G, we define the powers g n of G (for n G Z) as follows: 
if n is positive, then g n is the product of n factors g; g = 1; and g~ n = (g~ l ) n . 
The usual laws of exponents hold: g m+n = g m ■ g n and g mn = (g m ) n . 

A cyclic group is a group C which consists of all the powers (positive and 
negative) of a single element. If C consists of all the powers of g, then we write 
C = (g), and say that C is generated by g. 



Proposition 3.11 A cyclic group is Abelian. 

Proof Let C = (g). Take two elements of C, say g m and g M . Then 

g m -g n = g m + n = g n .g m . 

Let C = (g). Recall the order of g, the smallest positive integer n such that 
g n = 1 (if such n exists - otherwise the order is infinite). 



Proposition 3.12 Let g be an element of the a group G. Then the set of all powers 
(positive and negative) of g forms a cyclic subgroup ofG Its order is equal to the 
order of g. 
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Proof Let C = {g n : n E Z}. We apply the Second Subgroup test: if g m ,g n £ C, 
then (g m )(g n y l = g m ~ n EC. So C is a subgroup. 

If g has infinite order, then no positive power of g is equal to 1. It follows that 
all the powers g" for n e Z are different elements. (For if g m = g n , with m > n, 
then g"~ m = 1.) So C is infinite. 

Suppose that g has finite order n. We claim that any power of g is equal to one 
of the elements g° = l.g 1 = g, . . . ,g n ~ l . Take any power g m . Using the division 
algorithm in Z, write m = nq + r, where < r < n — 1 . Then 

g m = g nq+r =(g n ) q -g r =l-g r = g r . 

Furthermore, the elements . . . ,g are all different; for if g r = g s , with < 
r < s < n — 1, then g s ~ r = 1, and < s — r < n, contradicting the fact that n is the 
order of g (the smallest exponent i such that g 1 = 1). 

Example The additive group of the ring Z/nZ is a cyclic group of order n, 
generated by 1 = nZ + 1 . Remember that the group operation is addition here, 
and the identity element is zero, so in place of g n = 1 we have nl = 0, which is 
true in the integers mod n; moreover it is true that no smaller positive multiple of 
1 can be zero. 

Proposition 3.13 Let G be a cyclic group of finite order n. Then g has a cyclic 
subgroup of order mfor every m which divides n; and these are all the subgroups 
ofG 

Proof Let G = (g) = {l,g,g 2 , . . . .g"^ 1 }. If m divides n, let n = mk, and put 
h = g k . Then h m = (g k ) m = g n = 1, and clearly no smaller power of h is equal 
to 1; so h has order m, and generates a cyclic group of order m. 

Now let H be any subgroup of G. If H = {1}, then H is the unique cyclic 
subgroup of order 1 in G, so suppose not. Let g m be the smallest positive power 
of g which belongs to H. We claim that, if g k £ H, then m divides k. For let 
k = mq + r, where < r < m — 1. Then 

g r = g mi+r g- mq = g k (g m )- q eH, 

so r = (since m was the smallest positive exponent of an element of H. So H is 
generated by g m . Now g n = 1 e H, so m divides n, and we are done. 

3.2.3 Cosets 

Given any subgroup H of a group G, we can construct a partition of G into "cosets" 
of H, just as we did for rings. But for groups, things are a bit more complicated. 
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Because the group operation may not be commutative, we have to define two 
different sorts of cosets. 

Let H be a subgroup of a group G. Define a relation ~ r on G by the rule 

x ~ r y if and only if yx~ l E H. 

We claim that ~ r is an equivalence relation: 

reflexive: For any x E G, we have xx~ l = 1 G //, so ~ r x. 

symmetric: Suppose that x ~ r y, so that = yx~ l G //. Then hr l = (yx~ l )~ l = 
xy -1 E H, so y ~ r x. 

transitive: Suppose that x ~ r y and y ~ r z, so that h = yx~ 1 G H and = 1 G //. 

Then M = (zy^ 1 )(yx^ 1 ) = zx^ 1 G H, so x ~ r z. 

The equivalence classes of this equivalence relation are called the right cosets 
of H m G. 

A right coset is a set of elements of the form Hx = {hx : h G H}, for some 
fixed element x EG called the "coset representative". For 

y E Hx ^y = hx for some h E H <-> y^ 1 E H x ~ r y. 

We summarise all this as follows: 

Proposition 3.14 If H is a subgroup of the group G, then G is partitioned into 
right cosets ofH in G, sets of the form Hx = {hx : h E H}. 

In a similar way, the relation ~/ defined on G by the rule 

x ~/ y if and only if x~ l y EH 

is an equivalence relation on G, and its equivalence classes are the left cosets of H 
in G, the sets of the form xH = {xh : h E H}. 

If G is an abelian group, the left and right cosets of any subgroup coincide, 
since 

Hx = {hx:hEH} = {xh:hEH}= xH. 
This is not true in general: 
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Example Let G be the symmetric group 53 , and let H be the subgroup {1,(1,2)} 
consisting of all permutations fixing the point 3. The right cosets of H in G are 



HI = {1,(1,2)}, 

H(l,3) = {(1,3), (1,2,3)}, 

H(2,3) = {(2,3), (1,3,2), 

while the left cosets are 



ltf = {1,(1,2)}, 
(1,3)H={(1,3),(1,3,2)}, 
(2,3)H = {(2,3), (1,2,3)}. 

We see that, as expected, both right and left cosets partition G, but the two parti- 
tions are not the same. But each partition divides G into three sets of size 2. 



3.2.4 Lagrange's Theorem 

Lagrange's Theorem states a very important relation between the orders of a finite 
group and any subgroup. 

Theorem 3.15 (Lagrange's Theorem) Let H be a subgroup of a finite group G. 
Then the order ofH divides the order of G. 

Proof We already know from the last section that the group G is partitioned into 
the right cosets of H. We show that every right coset Hg contains the same number 
of elements as H. 

To prove this, we construct a bijection <j> from H to Hg. The bijection is 
defined in the obvious way: maps h to hg. 

• ^ is one-to-one: suppose that <j>(h\) = 0)%, that is, hig = h^g. Cancelling 
the g (by the cancellation law, or by multiplying by g^ 1 ), we get h\ = hi. 

• ^ is onto: by definition, every element in the coset Hg has the form hg for 
some h E H, that is, it is <j>(h). 

So is a bijection, and \Hg\ = \H\. 

Now, if m is the number of right cosets of H in G, then m\H\ = |G|, so \H\ 
divides \G\. 
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Remark We see that |G|/|/f| is the number of right cosets of H in G. This 
number is called the index of H in G. 

We could have used left cosets instead, and we see that |G|/|/f| is also the 
number of left cosets. So these numbers are the same. In fact, there is another 
reason for this: 

Exercise Show that the set of all inverses of the elements in the right coset Hg 
form the left coset g l H. So there is a bijection between the set of right cosets 
and the set of left cosets of H. 

In the example in the preceding section, we had a group 53 with a subgroup 
having three right cosets and three left cosets; that is, a subgroup with index 3. 

Corollary 3.16 let g be an element of the finite group G. Then the order of g 
divides the order ofG 

Proof Remember, first, that the word "order" here has two quite different mean- 
ings: the order of a group is the number of elements it has; while the order of an 
element is the smallest n such that g n = 1 . 

However, we also saw that if the element g has order m, then the set { 1 , g, g 2 , . . . , g m ~ 1 } 
is a cyclic subgroup of G having order m. So, by Lagrange's Theorem, m divides 
the order of G. 

Example Let G = S 3 . Then the order of G is 6. The element (1)(2,3) has 
order 2, while the element (1,3,2) has order 3. 

3.3 Homomorphisms and normal subgroups 

This section is similar to the corresponding one for rings. Homomorphisms are 
maps preserving the structure, while normal subgroups do the same job for groups 
as ideals do for rings: that is, they are kernels of homomorphisms. The structure 
of this section follows closely that of Section 2.3. 

3.3.1 Isomorphism 

Just as for rings, we say that groups are isomorphic if there is a bijection between 
them which preserves the algebraic structure. 

Formally, let G\ and G2 be groups. The map 6 : G\ — > G2 is an isomorphism 
if it is a bijection from G\ to G2 and satisfies 



{gh)6 = ( g e){he) for all g,heG h 
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Note that, as before, we write the map 9 to the right of its argument: that is, gO 
is the image of g under the map 9. If there is an isomorphism from G\ to G2, we 
say that the groups G\ and G\ are isomorphic. 

Example Let G\ be the additive group of Z/2Z, and let G2 be the symmetric 
group 52- Their Cay ley tables are: 



+ 





1 




1 


(1,2) 








1 


1 


1 


(1,2) 


1 


1 





(1,2) 


(1,2) 


1 



The map 9 that takes to 1, and 1 to (1,2), is clearly an isomorphism from G\ to 
G 2 . 

3.3.2 Homomorphisms 

An isomorphism between groups has two properties: it is a bijection; and it pre- 
serves the group operation. If we relax the first property but keep the second, we 
obtain a homomorphism. Just as for rings, we say that a function 9 : G\ — > G2 is 

• a homomorphism if it satisfies 

(gh)0 = ( g 0)(h0); (3.1) 

• a monomorphism if it satisfies (3.1) and is one-to-one; 

• an epimorphism if it satisfies (3.1) and is onto; 

• an isomorphism if it satisfies (3.1) and is one-to-one and onto. 

We have the following lemma, proved in much the same way as for rings: 

Lemma 3.17 Let 9 : G\ — > G2 be a homomorphism. Then \9 = 1; (g~ l )9 = 
(g9)- 1 ; and (gh- l )9 = (g9)(h9)-\ for all g,h E G h 

Now, if 9 : Gi — > G2 is a homomorphism, we define the image of 9 to be the 
subset 

{x E G2 : x = g9 for some g G Gi } 
of G2, and the kernel of 9 to be the subset 

{ g eG l :g6 = l} 

of Gi. 
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Proposition 3.18 Let : G\ — > G% be a homomorphism. 

(a) lm(0) is a subgroup 0/G2. 

(b) Ker(0) is a subgroup ofG\. 

Proof We use the Second Subgroup Test in each case. 

(a) Take x,y E lm(0), say x = gO and y = h0 for g,h E G\. Then xy -1 = 
(gh~ l )d E lm(0), by the Lemma. 

(b) Takeg,/zGKer(0). Then gO=hO = 1, so = = 1; so^ 1 e 
Ker(0). 

Example Look back to the Cayley table of the symmetric group 53 in Chapter 7. 
Colour the elements 1 , ( 1 , 2, 3) and (1,3,2) red, and the elements ( 1 , 2) , (2, 3) and 
(1,3) blue. We see that the Cayley table has the "simplified form" 





red 


blue 


red 


red 


blue 


blue 


blue 


red 



This is a group of order 2, and the map 6 taking 1, (1,2,3) and (1,3,2) to red 
and (1,2), (2,3) and (1,3) to blue is a homomorphism. Its kernel is the subgroup 

{1,(1,2,3), (1,3,2)}. 

3.3.3 Normal subgroups 

A normal subgroup is a special kind of subgroup of a group. Recall from the last 
chapter taht any subgroup H has right and left cosets, which may not be the same. 
We say that H is a normal subgroup of G if the right and left cosets of H in G are 
the same; that is, if Hx = xH for any x E G. 

There are several equivalent ways of saying the same thing. We define 

x l Hx = {x~ x hx :hEH} 

for any element x EG. 

Proposition 3.19 Let H be a subgroup ofG Then the following are equivalent: 

(a) H is a normal subgroup, that is, Hx = xH for all x E G; 

(b) x l Hx = H for all x E G; 

(c) x~ l hx E H, for all x EG and h E H. 
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Proof If Hx = xH, then x Hx = x xH = H, and conversely. So (a) and (b) are 
equivalent. 

If (b) holds then every element x~ l hx belongs to x ~ Hx, and so to H, so (c) 
holds. Conversely, suppose that (c) holds. Then every element of x l Hx belongs 
to H, and we have to prove the reverse inclusion. So take h EH. Putting y = x~ l , 
we have k = y~ l hy = xhx~ l E H, so h e x l Hx, finishing the proof. 

Now the important thing about normal subgroups is that, like ideals, they are 
kernels of homomorphisms. 

Proposition 3.20 Let 6 : Gi — > G2be a homomorphism. Then Ker(0) is a normal 
subgroup ofG\. 

Proof Let H = Ker(0). Suppose that h e H and x G G. Then 

( X - 1 hx)6 = ( X - l )0-h0 x0 = (xey 1 1x0 = 1, 

so x~ 1 hx E ker(0) = H. By part (c) of the preceding Proposition, H is a normal 
subgroup of G. 

There are a couple of situations in which we can guarantee that a subgroup is 
normal. 

Proposition 3.21 (a) IfGis Abelian, then every subgroup HofG is normal, 
(b) IfH has index 2 in G, then H is normal in G. 

Proof (a) If G is Abelian, then xH = Hx for all xeG. 

(b) Recall that this means that H has exactly two cosets (left or right) in G. 
One of these cosets is H itself; the other must consist of all the other elements of 
G, that is, G\H. This is the case whether we are looking at left or right cosets. 
So the left and right cosets are the same. 

Remark We saw in the last chapter an example of a group 53 with a non-normal 
subgroup having index 3 (that is, just three cosets). So we can't improve this 
theorem from 2 to 3. 

In our example in the last section, the subgroup {1,(1,2,3), (1,3,2)} of S3 has 
index 2, and so is normal, in 53; this also follows from the fact that it is the kernel 
of a homomorphism. 

For the record, here is a normal subgroup test: 

Proposition 3.22 (Normal subgroup test) A non-empty subset H of a group G is 
a normal subgroup of G if the following hold: 
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(a) for any h 7 k G H, we have hk G H; 

(b) for any h G H and x G G, we have x~ 1 hx G H. 

Proof (a) is the condition of the Second Subgroup Test, and we saw that (b) is a 
condition for a subgroup to be normal. 

3.3.4 Quotient groups 

Let H be a normal subgroup of a group G. We define the quotient group G/H as 
follows: 

• The elements of G/H are the cosets of H in G (left or right doesn't matter, 
since H is normal); 

• The group operation is defined by (Hx) (hy) = Hxy for all x,y G G; in other 
words, to multiply cosets, we multiply their representatives. 

Proposition 3.23 If H is a normal subgroup of G, then the quotient group G/H 
as defined above is a group. Moreover, the map 6 from G to G/H defined by 
xG = Hx is a homomorphism whose kernel is H and whose image is G/H. 

Proof First we have to show that the definition of the group operation is a good 
one. In other words, suppose that we chose different coset representatives x' and 
y' for the cosets Hx and Hy; is it true that Hxy = Hx'y'l We have x 1 = hx and 
/ = ky, for some h,k G H. Now xk belongs to the left coset xH. Since H is 
normal, this is equal to the right coset Hx, so that xk = Ix for some / G H. Then 
x'y' = hxky = (hi) (xy) G Hxy, since hi G H. Thus the operation is indeed well- 
defined. 

Now we have to verify the group axioms. 
(GO) Closure is clear since the product of two cosets is a coset. 
(Gl) Given three cosets Hx,Hy,Hz, we have 

((Hx)(Hy))(Hz) = (Hxy)(Hz)=H(xy)z = Hx(yz) = (Hx)(Hyz) = (Hx)((Hy)(Hz)), 
using the associative law in G. 
(G2) The identity is HI = H, since (Hl)(Hx) = H(lx) = Hx for all x G G. 
(G3) The inverse of Hx is clearly Hx 1 . 
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Finally, for the map 0, we have 

(xy)6 = Hxy = (Hx)(Hy) = (xO)(yO), 

so is a homomorphims. Its image consists of all cosets Hx, that is, lm(0) = 
G/H. The identity element of G/H is (as we saw in in the proof of (G2)) the 
coset H; and Hx = H if and only if x E H, so that Ker(0) = H. 

The map 9 in the above proof is called the natural homomorphism from G to 
G/H. We see that, if H is a normal subgroup of G, then it is the kernel of the 
natural homomorphism from G to G/H. So normal subgroups are the same thing 
as kernels of homomorphisms. 

Example Let G = S 3 , and let H be the subgroup { 1 , ( 1 , 2, 3) , ( 1 , 3, 2) }. We have 
observed that H is a normal subgroup. It has two cosets, namely Hi — H and 
H(l,2) = {(1,2), (2,3), (1,3)}. The rules for multiplication of these cosets will 
be the same as the rules for multiplying the elements 1 and (1,2). So G/H is 
isomorphic to the group { 1 , ( 1 , 2) } of order 2. 

3.3.5 The Isomorphism Theorems 

The Isomorphism Theorems for groups look just like the versions for rings. 

Theorem 3.24 (First Isomorphism Theorem) Let G\ and G2 be groups, and let 
6 : G\ — > G2 be a homomorphism. Then 

(a) lm(0) is a subgroup 0/G2; 

(b) Ker(0) is a normal subgroup of G\; 

(c) Gi/Ker(0)^lm(0). 

Proof We already proved the first two parts of this theorem. We have to prove (c). 
That is, we have to construct a bijection from G/N to lm(0), where = Ker(0), 
and prove that it preserves the group operation. 
The map § is defined by (Nx) = xO . We have 

(Nx)(j) = (Ny)(j) ^x9=y0^ {xy^ l )0 = 1 xy~ l E Ker(0) = N ^Nx = Ny, 

so is well-defined and one-to-one. It is clearly onto. Finally, 

(Nx)(j) ■ (Ny)(j) = (x8)(y8) = (xy)0 = (Nxy)(j) = ((Nx)(Ny))(j), 



so ^ preserves the group operation as required. 



3.3. HOMOMORPHISMS AND NORMAL SUBGROUPS 
The same picture as for rings may be useful: 



77 



Ker(0) • ! 




*- • 








K * 





G\ G2 



The parts on the left are the cosets ofN = Ker(0), where N itself is the topmost 
part. Each coset of N maps to a single element of lm(0), and the correspondence 
between cosets and elements of lm(0) is the bijection of the last part of the theo- 
rem. 

The other two theorems will be stated without proof. You are encouraged to 
try the proofs for yourself; they are very similar to the proofs for rings. 

Theorem 3.25 (Second Isomorphism Theorem) Let N be a normal subgroup of 
the group G. Then there is a one-to-one correspondence between the subgroups of 
G/N and the subgroups ofG containing N, given as follows: to a subgroup HofG 
containing N corresponds the subgroup H/NofG/N. Under this correspondence, 
normal subgroups of G/N correspond to normal subgroups of G containing N; 
and, ifM is a normal subgroup of G containing N, then 

(G/N)/(M/N) = G/M. 

For the next theorem we need to define, for any two subsets A, B of a group G, 
the set 

AB = {ab : a G A,b eB} 
of all products of an element of A by an element of B. 

Theorem 3.26 (Third Isomorphism Theorem) Let Gbe a group, H a subgroup 
ofG, andN a normal subgroup ofG. Then 

(a) HN is a subgroup ofG containing N; 

(b) HON is an normal subgroup ofH; 

(c) H/(HC]N) ^ (HN) /N. 

We end this section with one fact about groups which doesn't have an obvious 
analogue for rings. 
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Proposition 3.27 Let H and K be subgroups of the group G. 

(a) HK is a subgroup of G if and only if UK = KH. 

(b) IfK is a normal subgroup of G, then HK = KH. 

Proof (a) Suppose that HK = KH. Then every element of the form kh (for k E 
K and h E H) can also be expressed in the form h'k' (for h' E H and k' E K). 
Now we apply the subgroup test to HK. Take h\k\,hjk2 £ HK. We want to 
know if hxkxih'ik'i)^ 1 E HK. This expression is h\kik 2 h 2 . Now k\k 2 1 E K, 
so (k\k 2 ) n 2 1 e so we can wr ite this element as h'k', for some h' E H and 
£' G Then 

hikik 2 l h 2 l = (hh')k' E HK, 

as required. 

Conversely, suppose that HK is a subgroup. We have to show that HK = KH, 
that is, every element of one is in the other. Take any element x E HK. Then 
x~ l E HK, so x~ l = hk, for some h E H and k E K. Then x = k~ x hr x E KH. The 
reverse inclusion is proved similarly. 

(b) If K is a normal subgroup, then the Third Isomorphism Theorem shows 
that HK is a subgroup, so that HK = KH by Part (a). 



Exercise If H and K are subgroups of G, show that 

, \H\-\K\ 

HK 



HHK 



whether or not HK is a subgroup. [Hint: there are \H\ • \K\ choices of an expression 
hk. Show that every element in HK can be expressed as such a product in \H D K 
different ways.] 



Example Let G = S3, H = {1, (1,2)} and K = {1, (2,3)}. Then H and K are 
two subgroups of G, each of order 2, and H(~)K = {1}, so \HK\ = 4. Since 4 
doesn't divide 6, Lagrange's Theorem shows that HK cannot be a subgroup of G. 
This shows, once again, that H and K are not normal subgroups of G. 



3.3.6 Conjugacy 

Conjugacy is another equivalence relation on a group which is related to the idea 
of normal subgroups. 

Let G be a group, we say that two elements g,hEG are conjugate ifh = x~ l gx 
for some element x E G. 
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Proposition 3.28 (a) Conjugacy is an equivalence relation on G. 

(b) A subgroup H of G is normal if and only if it is a union of (some of the) 
conjugacy classes in G. 

Proof (a) Write g ~ h to mean that h = x~ 1 gx for some x E G. Then 

• g = \~ 1 gl, so g ~ g: ~ is reflexive. 

• If h =x~ l gx, then g = (x~ l )~ l h(x~ 1 ): so ~ is symmetric. 

• Suppose that g ~ h and h ~ k. Then h = x~ l gx and k = y~ l hy for some x, y. 
Then k = y~ l x~ l gxy = (xy)~ l g(xy), so g ~ k: ~ is transitive. 

(b) The condition that H is a union of conjugacy classes means that, if h e H, 
then any element conjugate to h is also in H. We saw in Proposition 3.19 that this 
condition is equivalent to normality of H. 

Exercise Let G — S3. Show that the conjugacy classes in G are {1}, {(1,2, 3), (1, 3,2)}, 
and {(1,2), (2,3), (1,3)}. (We will look at conjugacy in symmetric groups in the 
next section.) 

3.4 Symmetric groups and Cayley's Theorem 

Cayley's Theorem is one of the reasons why the symmetric groups form such 
an important class of groups: in a sense, if we understand the symmetric groups 
completely, then we understand all groups! 

Theorem 3.29 Every group is isomorphic to a subgroup of some symmetric group. 

Before we give the proof, here is a small digression on the background. Group 
theory began in the late 18th century: Lagrange, for example, proved his theorem 
in 1770. Probably the first person to write down the axioms for a group in anything 
like the present form was Dyck in 1882. So what exactly were group theorists 
doing for a hundred years? 

The answer is that Lagrange, Galois, etc. regarded a group as a set G of per- 
mutations with the properties 

• G is closed under composition; 

• G contains the identity permutation; 

• G contains the inverse of each of its elements. 

In other words, G is a subgroup of the symmetric group. 
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Thus, Cayley's contribution was to show that 
every group (in the modern sense) could be 
regarded as a group of permutations; that is, 
every structure which satisfies the group 
axioms can indeed be thought of as a group 
in the sense that Lagrange and others would 
have understood. 
In general, systems of axioms in 
mathematics are usually not invented out of 
the blue, but are an attempt to capture some 
theory which already exists. 

3.4.1 Proof of Cayley's Theorem 

We begin with an example. Here is the Cay ley table of a group we have met 
earlier: it is C2 x Ci, or the additive group of the field of four elements. When 
we saw it in Chapter 7, its elements were called e,a,b,c; now I will call them 

gl»g2,g3»g4- 





81 


82 


83 


84 


81 


81 


82 


83 


84 


82 


82 


81 


84 


83 


83 


83 


84 


81 


82 


84 


84 


83 


82 


81 



Now consider the four columns of this table. In each column, we see the four 
group elements g\,...,g4, each occurring once; so their subscripts form a permu- 
tation of { 1 , 2, 3, 4}. Let %i be the permutation which is given by the ith column. 
For example, for i = 3, the elements of the column are (g3,g4 1 gi,g 2 ), an d so 

( \ 2 3 4\ 

713 is the permutation which is I ^ ^ ^ ^ J m two-line notation, or ( 1 , 3) (2, 4) 

in cycle notation. 

The four permutations which arise in this way are: 

K\ = 1 

K 2 = (1,2)(3,4) 
7r 3 = (1,3)(2,4) 
%4 = (1,4)(2,3) 

Now we claim that {^1,^2,^3,^4} is a subgroup H of the symmetric group 
S4, and that the map defined by gj6 = %i is an isomorphism from G to H. (This 
means that, if gigj = g^, then = Ttk, where permutations are composed in the 
usual way.) This can be verified with a small amount of checking. 
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You might think that it would be easier to use rows instead of columns in this 
argument. In the case of an Abelian group, like the one in this example, of course 
it makes no difference since the Cay ley table is symmetric; but for non- Abelian 
groups, the statement would not be correct for rows. 

So now we come to a more precise statement of Cayley's Theorem. We assume 
here that the group is finite; but the argument works just as well for infinite groups 
too. 

Theorem 3.30 (Cayley's Theorem) Let G = {g\, . . . ,g n } be a finite group. For 
j E { 1 , . . . , n}, let Ttj be the function from { 1 , . . . , n} to itself defined by the rule 

iKj = k if and only if gig j = g k . 

Then 

(a) Ttj is a permutation of{l,...,n}; 

(b) the set H = {tci, . . . , Tt n } is a subgroup ofS n ; 

(c) the map 6 : G — > S n given by gjO = TCj is a homomorphism with kernel {1} 
and image H; 

(d) G is isomorphic to H. 

Proof (a) To show that Ttj is a permutation, it is enough to show that it is one- 
to-one, since a one-to-one function on a finite set is onto. (For infinite groups, 
we would also have to prove that it is onto.) So suppose that i\%j = iiTtj = k. 
This means, by definition, that g i{ gj = g i2 gj = g k . Then by the cancellation law, 

gh = gi 2 = gkgj\ and so h = i 2 . 

(c) Clearly the image of 6 is H. So if we can show that 6 is a homomorphism, 
the fact that H is a subgroup of S n will follow. 

Suppose that gjg^ = g[. We have to show that TZjKk — 7C[, in other words (since 
these are functions) that {iKj)Kk = iiti for any / e { 1 , . . . , n}. Define numbers p, q, r 
by gigj = g P , gpgk = gq, and gigi = g r . Then iitj = p, pn k = q (so i%j% k = q), and 
ITT/ = r. So we have to prove that q = r. But 

gq = gpgk = (gigj)gk = gi(gjgk) = gigl = gn 

so q = r. 

Now the kernel of 6 is the set of elements gj for which Ttj is the identity 
permutation. Suppose that gj E Ker(0). Then iKj = i for any i E {1,. . . ,n}. This 
means that gigj = gi, so (by the cancellation law) gj is the identity. So Ker(0) = 
{!}. 
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Now the First Isomorphism Theorem shows that H = lm(0) is a subgroup of 
S n (that is, (b) holds), and that 

G^G/{1} = G/Ker(e) =H, 

that is, (d) holds. So we are done. 

Remark There may be other ways to find a subgroup of S n isomorphic to the 
given group G. For example, consider the group G = S3, a non-abelian group 
of order 6, whose Cay ley table we wrote down in Chapter 7. From this table, 
you could find a set of six permutations in the symmetric group which form a 
subgroup of S(, isomorphic to G. But G is already a subgroup of 53 ! 

3.4.2 Conjugacy in symmetric groups 

We finish this chapter with two more topics involving symmetric groups. First, 
how do we tell whether two elements of S n are conjugate? 

We define the cycle structure of a permutation g E S n to be the list of cycle 
lengths of g when it is expressed in cycle notation. (We include all cycles includ- 
ing those of length 1 .) The order of the terms in the list is not important. Thus, for 
example, the permutation (1,7) (2, 6, 5) (3, 8, 4) has cycle structure [2, 3, 3] . 

Proposition 3.31 Two permutations in S n are conjugate in S n if and only if they 
have the same cycle structure. 

Proof Suppose that (a\ ,02, . . . , a r ) is a cycle of a permutation g. This means that 
g maps 

a\ 1— > ai 1— > . . . 1— > a r 1— > a\ . 
We claim that h = x~ 1 gx maps 

a\x I— > a2X I— > ... 1— > a r x 1— > a\x, 

so that it has a cycle (ai,a2, . . . ,a r ). This is true because 

[aix)h — [aiX){xT l gx) = aigx = at + \x 

for i = I,... ,r—l, and (a r x)h = a\x. 

This shows that conjugate elements have the same cycle structure. The recipe 
is: given g in cycle notation, replace each element at in each cycle by its image 
under x to obtain x~ 1 gx in cycle notation. 

Conversely, suppose that g and h have the same cycle structure. We can write h 
under g so that cycles correspond. Then the permutation x which takes each point 
in a cycle of g to the corresponding point in a cycle of h is the one we require, 
satisfying h = x~ l gx, to show that g and h are conjugate. 
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Example The permutations g= (1,7) (2, 6, 5) (3, 8,4) andh = (2, 3) (1,5, 4) (6, 8, 7) 
are conjugate. We can take x to be the permutation given by 
in two-line notation, or (1,2) (3, 6, 5, 4, 7) (8) in cycle notation. 

3.4.3 The alternating groups 

You have probably met the sign of a permutation in linear algebra; it is used in the 
formula for the determinant. It is important in group theory too. 

Let g be an element of S n , which has k cycles when written in cycle notation 
(including cycles of length 1). We define its sign to be ( — \) n ~ k . 

Note that the sign depends only on the cycle structure; so conjugate permuta- 
tions have the same sign. 

Theorem 3.32 The function sgn is a homomorphismfrom the symmetric group S n 
to the multiplicative group {+1,-1}. For n>2, it is onto; so its kernel (the set 
of permutations of {I, . . . ,n} with sign +) is a normal subgroup of index 2 in S n , 
called the alternating group A n . 

Example For n = 3, the permutations with sign +1 are 1, (1,2,3) and (1,3,2), 
while those with sign —1 are (1,2), (2,3) and (1,3). We have seen that the first 
three form a normal subgroup. 

Proof We define a transposition to be a permutation of {1, . . . ,n} which inter- 
changes two points i and j and fixes all the rest. Now a transposition has cycle 
structure [2, 1,1,..., 1], and so has n — 1 cycles; so its sign is ( — l) 1 = — 1. 
We show the following two facts: 

(a) Every permutation can be written as the product of transpositions. 

(b) If t is a transposition and g any permutation, then sgn(gt) = — sgn(g). 

Now the homomorphism property follows. For take any g,h E S n . Write h = 
t\t2 ...t r , where t\ , . . . ,t r are transpositions. Then applying (b) r times, we see that 
sgn(gh) = sgn(g)( — l) r . But also sgn(/z) = (— l) r (using the identity instead of g), 
so sgn(gh) = sgn(g) sgn(/z). Thus sgn is a homomorphism. Since sgn(l,2) = — 1, 
we see that Im(sgn) = {+1,-1}. So, if A n denotes Ker(sgn), the First Isomor- 
phism Theorem shows that S n /A n = {±1}, so that A n has two cosets in S n (that is, 
index 2). 

Proof of (a): Take any permutation g, and write it in cycle notation, as a prod- 
uct of disjoint cycles. It is enough to show that each cycle can be written as a 
product of transpositions. Check that 

(fll,fl2,---,«r) = («1, «2) («1,«3) •••(«! ,d r )- 



(1 7 2 6 5 3 8 4\ 
12 3 154687) 
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Proof of (b): Again, write g in cycle notation. Now check that, if t interchanges 
points in different cycles of g, then in the product gt these two cycles are "stitched 
together" into a single cycle; while, if t interchanges points in the same g-cycle, 
then this cycle splits into two in gt. For example, 



So multiplying g by a transposition changes the number of cycles by one (either 
increases or decreases), and so multiplies the sign by —1. 

The proof shows another interesting fact about permutations. As we saw, every 
permutation can be written as a product of transpositions. 

Corollary 3.33 Given any two expressions of g G S n as a product of transposi- 
tions, the numbers of transpositions used have the same parity, which is even if 
sgn(g) = +1 and odd if sgn(g) = — 1. 

Proof We saw that if g is the product of r transpositions, then sgn(g) = (— \) r . 
This must be the same for any other expression for g as a product of transpositions. 

Example (1,2) = (1,3) (2, 3) (1,3); one expression uses one transposition, the 
other uses three. 

3.5 Some special groups 

In the final section we make the acquaintance of some further types of groups, and 
investigate more closely the groups 54 and S5. 

3.5.1 Normal subgroups of S4 and S$ 

In this section, we find all the normal subgroups of the groups 54 and S5 . There 
are two possible approaches we could take. We could find all the subgroups and 
check which ones on our list are normal. But, for example, S5 has 156 subgroups, 
so this would be quite a big job! The approach we will take is based on the fact 
that a subgroup of G is normal if and only if it is a union of conjugacy classes. 
So we find the conjugacy classes in each of these groups, and then figure out how 
to glue some of them together to form a subgroup (which will automatically be 
normal). 



(l,3,5,7)(2,4,6)-(3,4) 
(1,2,5,3,8,6,4,7) -(3,4) 



(1,4,6,2,3,5,7), 
(1,2,5,4,7)(3,8,6). 
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Recall from the last chapter that two permutations in S n axe conjugate if and 
only if they have the same cycle structure. So first we list the possible cycle struc- 
tures and count the permutations with each structure. For 54, we get the following 
table. (We list the sign in the last column; we know that all the permutations with 
sign +1 must form a normal subgroup. The sign is of course ( — \) n ~ k , where k is 
the number of cycles.) 



Cycle structure 


Number 


Sign 


[1,1,1,1] 


1 


+ 


[2,1,1] 


6 




[2,2] 


3 


+ 


[3,1] 


8 


+ 


[4] 


6 




Total 


24 





How do we compute these numbers? There is a general formula for the num- 
ber of permutations with given cycle structure. If you want to use it to check, here 
it is. Suppose that the cycle structure is [ai, a2, . . . , a r ], and suppose that in this list 
the number 1 occurs m\ times, the number 2 occurs mi times, and so on. Then the 
number of permutations with this cycle structure is 

n\ 

\mi mi \2 m 2m 2 \ 

So for example, for the cycle structure [2,2], we have two 2s and nothing else, so 
ni2 = 2, and the number of permutations with cycle structure [2, 2] is 4!/(2 2 2!) = 
3. 

In small cases we can argue directly. There is only one permutation with 
cycle structure [1,1,1,1], namely the identity. Cycle structure [2, 1, 1] describes 
transpositions, and there are six of these (the number of choices of the two points 
to be transposed). For cycle structure [2,2] we observe that the six transpositions 
fall into three complementary pairs, so there are three such elements. For [3, 1], 
there are four choices of which point is fixed, and two choices of a 3-cycle on the 
remaining points. Finally, for [4], a 4-cycle can start at any point, so we might 
as well assume that the first point is 1. Then there are 3! = 6 ways to put the 
remaining points into a bracket (1, , , ) to make a cycle. 

Having produced this table, how can we pick some of the conjugacy classes to 
form a subgroup? We know that a subgroup must contain the identity, so the first 
class must be included. Also, by Lagrange's Theorem, the order of any subgroup 
must divide the order of the group. So, unless we take all five classes, we cannot 
include a class of size 6. (For then the order of the subgroup would be at least 7, so 
necessarily 8 or 12, and we cannot make up 1 or 5 elements out of the remaining 
classes.) So the only possibilities are: 
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• Take just the class {1}. This gives the trivial subgroup, which is certainly 
normal. 

• Take {1} together with the class [2,2], giving four elements. We have to 
look further at this. 

• Take {1} together with the classes [2,2] and [1,3]. These are all the even 
permutations, so do form a normal subgroup, namely the alternating group 
A 4 . 

• Take all five classes. This gives the whole of 54, which is a normal subgroup 
of itself. 

The one case still in doubt is the possibility that the set 

y 4 = {l,(l,2)(3,4),(l,3)(2,4),(l,4)(2,3)} 

is a normal subgroup. Of course, if it is a subgroup, then it is normal, since it 
consists of two conjugacy classes. And it is a subgroup; our example of Cayley's 
Theorem produced precisely this subgroup! It is called V from the German word 
vier, meaning "four"; it is sometimes referred to as the "four-group". 
We have proved: 

Proposition 3.34 The group S4 has four normal subgroups. These are the iden- 
tity, the four- group V4, the alternating group A4, and the symmetric group S4. 

What about the factor groups? Clearly 5 4 /{l} = S 4 , while S4/S4 = {1}. We 
know that S4/A4 is isomorphic to the multiplicative group {±1}, which is a cyclic 
group of order 2. One case remains: 

Proposition 3.35 S4/V4 = S 3 . 

Proof There are many ways to see this. Here is the simplest. 

Consider the subgroup 53 of S4 consisting of all permutations fixing the point 
4. We have \S 3 \ = 6, | V4 1 = 4, and 53 n V4 = {1} (by inspection of the elements of 
V4), so IS3V4I = 24; that is, S3V4 = S4. Now, by the Third Isomorphism Theorem, 

S4/V4 = s 3 v 4 /v 4 = s 3 /(s 3 nv 4 ) = s 3 /{\} = s 3 . 

We now look at S5 and show: 

Proposition 3.36 The group S5 has three normal subgroups: the identity, the al- 
ternating group A5, and the symmetric group S5. 
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Cycle structure 


Number 


Sign 


[1,1,1,1,1] 


1 


+ 


[2,1,1,1] 


10 




[2,2,1] 


15 


+ 


[3,1,1] 


20 


+ 


[3,2] 


20 




[4,1] 


30 




[5] 


24 


+ 


Total 


120 





Number these classes as C\ , . . . , Cq in order. We have to choose some of them 
including C\ such that the sum of the numbers is a divisor of 120. All classes 
except C\ and Cj have size divisible by 5; so if we don't include Cq then the total 
divides 24, which is easily seen to be impossible. So we must have Cj. Now, 
since we are trying to build a subgroup, it must be closed under composition; so 
any cycle type which can be obtained by multiplying together two 5-cycles has to 
be included. Since 

(1,2,3,4,5)(1, 5,4,2,3) = (1,3,2), 
(1,2,3,4,5)(1, 2,3,5,4) = (1,3)(2,5), 

both classes C3 and C4 must be included. Now C\ U C3 U C4 U Cj is the alternating 
group A5, and if there is anything else, we must have the entire symmetric group. 



3.5.2 Dihedral groups 



One important source of groups is as symmetries of geometric figures. Here is 
an example. Consider a square, as shown in the figure. (We have marked various 
axes of symmetry as dotted lines.) 

B 




D 



C 



Now the square has eight symmetries, four rotations and four reflections. They 
are given in the table, together with their effect as permutations of the four vertices 
of the square. The rotations are taken clockwise. 
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Symmetry 


Permutation 


Identity 


1 


Rotation through 90° about 


(1,2,3,4) 


Rotation through 180° about 


(1,3)(2,4) 


Rotation through 270° about 


(1,4,3,2) 


Reflection about AC 


(1,2)(3,4) 


Reflection about BD 


(M)(2,3) 


Reflection about 13 


(2,4) 


Reflection about 24 


(1,3) 



The eight symmetries form a group (with the operation of composition). The 
corresponding permutations form a group, a subgroup of the symmetric group 
S4, which is isomorphic to the group of symmetries. This group is non-Abelian. 
(This can be seen by composing symmetries, or by composing permutations. For 
example, (1,2,3,4)(1,3) = (1,2)(3,4), while (1,3)(1, 2,3,4) = (1,4)(2,3).) 

More generally, a regular n-gon has 2n symmetries, n rotations and n reflec- 
tions, forming a group which is known as the dihedral group D2 n - (Thus the group 
in the table above is D%. You should be warned that some people refer to what I 
have called as simply D n .) 

Here are some properties of dihedral groups, which should be clear from the 
figure. 

• The n rotations form a cyclic subgroup C n of Z^. (This subgroup has in- 
dex 2 in D2n, so it is a normal subgroup.) 

• If n is odd, then every reflection axis joins a vertex to the midpoint of the op- 
posite side; while if n is even, then n/2 axes join the midpoints of opposite 
sides and n/2 join opposite vertices. 

• Any reflection has order 2. 

• If a is a rotation and b a reflection, then bob = a~ l . 

The last condition says: reflect the figure, rotate it clockwise, and then reflect 
again; the result is an anticlockwise rotation. This gives another proof that the 
rotations form a normal subgroup. For let a be a rotation, and x any element. If x 
is a rotation, then ax = xa, so x~ l ax = a. If x is a reflection, then x~ l = x, and so 
x ax = aT x . So any conjugate of a rotation is a rotation. 

The definition of D4 is not clear, since there is no regular 2-gon. But if we take 
the pattern in Din an d apply it for n = 2, we would expect a group with a cyclic 
subgroup of order 2, and all elements outside this subgroup having order 2. This 
is a description of the four-group. 
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Moreover, is the group of symmetries of an equilateral triangle, and the six 
permutations of the vertices comprise all possible permutations. 
So the following is true. 

Proposition 3.37 (a) The dihedral group D4 is isomorphic to the four-group 
V 4 . 

(b) The dihedral group D§ is isomorphic to the symmetric group S3. 

3.5.3 Small groups 

How many different groups of order n are there? (Here, "different" means "non- 
isomorphic".) This is a hard question, and the answer is not known in general: it 
was only six years ago that the number of groups of order 1024 was computed: 
the number is 49, 487, 365, 422. (The result of this computation was announced at 
Queen Mary.) 

We will not be so ambitious. For small n, the number of groups of order n is 
given in this table. We will verify the table up to n = 1. 



Order 


1 2 3 4 5 6 7 8 


Number of groups 


1112 12 15 



Clearly there is only one group of order 1 . The result for n = 2, 3, 5, 7 follows 
from the next Proposition. 

Proposition 3.38 A group of prime order p is isomorphic to the cyclic group C p . 

Proof Take an element g of such a group G, other than the identity. By La- 
grange's Theorem, the order of g must divide p, and so the order must be 1 or p 
(since p is prime). But g 7^ 1, so its order is not 1; thus it is p. So G is a cyclic 
group generated by g. 

Next we show that there are just two groups of order 4. We know two such 
groups already: the cyclic group C4, and the four-group V4. (See the Cayley tables 
in 3.1.2 of the notes). 

Let G be a group of order 4. Then the order of any element of G divides 4, 
and so is 1, 2 or 4 by Lagrange's Theorem. If there is an element of order 4, then 
G is cyclic; so suppose not. Then we can take G = {l,a,b,c}, where a 2 = b 2 = 
c 2 = 1. What is ab! It cannot be 1, since ab = 1 = a 2 would imply a = b by 
the Cancellation Law. Similarly ab ^ a and ab 7^ b, also by the Cancellation Law 
(these would imply b = 1 or a = 1 respectively). So ab = c. Similarly, all the 
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other products are determined: the product of any two non-identity elements is 
the third. In other words, the Cayley table is 





1 


a 


b 


c 


1 


1 


a 


b 


c 


a 


a 


1 


c 


b 


b 


b 


c 


1 


a 


c 


c 


b 


a 


1 



We recognise the Klein four-group. So there is just one such group (up to isomor- 
phism), giving two altogether. 

To deal with order 6, we need a couple of preliminary results. 

Proposition 3.39 A group of even order must contain an element of order 2. 

Proof Take any group G of order n, and attempt to pair up the elements of G with 
their inverses. Suppose that we can form m pairs, accounting for 2m elements. The 
elements we have failed to pair up are the ones which satisfy g = g^ 1 , or g 2 = 1; 
these include the identity (one element) and all the elements of order 2. So there 
must be n — 2m — 1 elements of order 2. If n is even, then n — 2m — 1 is odd, and 
so cannot be zero; so there is an element of order 2 in G. 

Proposition 3.40 A finite group in which every element has order 2 (apart from 
the identity) is Abelian. 

Proof Take any g,h EG. We have 

(gh) 2 = ghgh = 1, 
g 2 h 2 = gghh = 1, 

so by cancellation, hg = gh. Thus G is Abelian. 

Now let G be a group of order 6. If G contains an element of order 6, then it 
is cyclic; so suppose not. Now all its elements except the identity have order 2 or 
3. The first proposition above shows that G contains an element a of order 2. The 
second shows that it must also have an element of order 3. For, suppose not. Then 
all non-identity elements of G have order 2. If g and g are two such elements, 
then it is easy to see that {\,g,h,gh — hg] is a subgroup of order 4, contradicting 
Lagrange's Theorem. 

So let a be an element of order 3 and an element b of order 2. The cyclic 
subgroup (a) = {l,a,a = a 2 } of G has order 3 and index 2, so it is normal. So 
b~ l ab E (a), whence b~ l ab = a or b~ x ab = a -1 . 
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If b l ab = a, then ab = ba, so (ab) 1 = a l b l for all i. Then the powers of ab are 

(ab) 2 = a 2 b 2 = a 2 , (ab) 3 = a 3 b 3 = b, (ab) 4 = a 4 b 4 = a, 
(ab) 5 = a 5 b 5 = a 2 b, (ab) 6 = a 6 b 6 = 1, 

so the order of ab is 6, contradicting our case assumption. So we must have 
b~ l ab = aT x , or ba = aT l b = a 2 b. 

Now using this, the Cayley table of G is completely determined: all the el- 
ements have the form a l b J \ where i = 0,1,2 and j = 0, 1; to multiply a l V by 
we use the condition ba = a l b to jump the first b over the as to its right if 
necessary and the conditions a 3 = b 2 = 1 to reduce the exponents. For example, 

a b-ab = a (ba)b = a (a b)b = a b = a. 

So there is only one possible group of this type. Its Cayley table is: 





1 


a 


a 2 


b 


ab 


a 2 b 


1 


1 


a 


a 2 


b 


ab 


a 2 b 


a 


a 


a 2 


1 


ab 


a 2 b 


b 


a 2 


a 2 


1 


a 


a 2 b 


b 


ab 


b 


b 


a 2 b 


ab 


1 


a 2 


a 


ab 


ab 


b 


a 2 b 


a 


1 


a 2 


a 2 b 


a 2 b 


ab 


b 


a 2 


a 


1 



These relations are satisfied in the group 53, if we take a = (1,2,3) and b = 
(1,2). So there is such a group; and so there are two groups of order 6 altogether 
(the other being the cyclic group). Alternatively, we could observe that the above 
relations characterise the dihedral group so the two groups are and Dg. 

We see, incidentally, that 53 = D 6 , and that this is the smallest non-Abelian 
group. 

3.5.4 Polyhedral groups 

You have probably seen models of the five famous regular polyhedra: the tetra- 
hedron, the cube (or hexahedron), the octahedron, the dodecahedron, and the 
icosahedron. These beautiful figures have been known since antiquity. See Fig- 
ures 3.1, 3.2. 

What are their symmetry groups? 

Here I will just consider the groups of rotations; the extra symmetries realised 
by reflections in three-dimensional space make the situation a bit more compli- 
cated. As in the case of the dihedral groups, these groups can be realised as 
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Figure 3.1: Tetrahedron, cube, and octahedron 




Figure 3.2: Dodecahedron and Icosahedron 

permutation groups, by numbering the vertices and reading off the permutation 
induced by any symmetry. 

Moreover, there are five figures, but only three groups. Apart from the tetra- 
hedron, the figures fall into "dual pairs": the figure whose vertices are the face 
centres of a cube is an octahedron and vice versa, and a similar relation holds be- 
tween the dodecahedron and the icosahedron. Dual pairs have the same symmetry 
group. (The face centres of the tetrahedron are the vertices of another tetrahedron, 
so this figure is "self-dual".) The following result describes the three symmetry 
groups. 

Proposition 3.41 (a) The tetrahedral group is isomorphic to A4. 

(b) The octahedral group is isomorphic to S4. 

(c) The icosahedral group is isomorphic to A5. 

Proof I will outline the proof. First we compute the orders of these groups. If 
a figure has m faces, each a regular polygon with n sides, then the number of 
rotational symmetries is mn. For imagine the figure with one face on the table. I 
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can pick it up and rotate it so that any of the m faces is on the table, in any of the 
n possible orientations. Now we have 

• Tetrahedron: m = 4, n = 3, group order 12. 

• Cube: m = 6, n = 4, group order 24. 

• Octahedron: m = 8, n = 3, group order 24. 

• Dodecahedron: m = 12, n = 5, group order 60. 

• Icosahedron: m = 20, n = 3, group order 60. 

We see that the symmetry groups of dual polyhedra have the same order, as they 
should. 

(a) Any symmetry of the tetrahedron permutes the four vertices. So the sym- 
metry group is a subgroup of 54 of order 12. To see that it is A4, we simply have 
to observe that every symmetry is an even permutation of the vertices. A rotation 
about the line joining a vertex to the midpoint of the opposite face has cycle struc- 
ture [3, 1], while a rotation about the line joining the midpoints of opposite edges 
has cycle structure [2,2]. (Alternatively, a subgroup of 54 of order 12 has index 2 
and is therefore normal; and we have worked out the normal subgroups of 54. The 
only one of order 12 is A4.) 

(b) Consider the cube. It has four diagonals joining opposite vertices. Any 
symmetry induces a permutation of the four diagonals. It is not hard to see that the 
map from symmetries to permutations is one-to-one. So the group is isomorphic 
to a subgroup of S4 of order 24, necessarily the whole of S4. 

(c) This is the hardest to prove. But in fact it is possible to embed a cube so that 
its vertices are eight of the 20 vertices of the dodecahedron in five different ways. 
These five inscribed cubes are permuted by any rotation, so we have a subgroup 
of S5 of order 60. This subgroup has index 2 and so is normal; so it must be A5. 
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